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Preface 



Graph drawing addresses the problem of constructing representations of abstract 
graphs, networks, and hypergraphs. 

The 6th Symposium on Graph Drawing (GD ’98) was held August 13-15, 
1998, at McGill University, Montreal, Ganada. It immediately followed the Tenth 
Ganadian Gonference on Gomputational Geometry (GGGG ’98), held August 
10-12 at McGill. The GD ’98 conference attracted 100 paid registrants from 
academic and industrial institutions in thirteen countries. Roughly half the par- 
ticipants also attended GGGG ’98. As in the past, interaction among researchers, 
practitioners, and students from theoretical computer science, mathematics, and 
the application areas of graph drawing continued to be an important aspect of 
the graph drawing symposium. 

In response to the call for papers and system demonstrations, the program 
committee received 57 submissions, of which 10 were demos. Each submission 
was reviewed by at least 4 members of the program committee, and comments 
were returned to the authors. Following extensive email discussions and multiple 
rounds of voting, the program committee accepted 23 papers and 9 demos. 

GD ’98 also held an unrefereed poster gallery. The poster gallery contained 
16 posters, 14 of which have abstracts in this volume. The poster gallery served 
to encourage participation from researchers in related areas and provided a stim- 
ulating environment for the breaks between the technical sessions. 

In keeping with the tradition of previous graph drawing conferences, GD ’98 
held a graph drawing contest. This contest, which is traditionally a conference 
highlight, serves to monitor and to challenge the state of the art in graph drawing. 
A report on the 1998 contest appears in this volume. 

Many people in the graph drawing community contributed to the success 
of GD ’98. In particular, the authors of submitted papers, demos, and posters 
are due special thanks, as are the members of the program committee and the 
referees it consulted. Much thought and care went into the selection of a high 
quality technical program. 

Very special thanks are due to Steve Robbins, who handled the Web site, the 
electronic submission process, and the equipment for the demos, and to Therese 
Biedl, whose hard work and efficiency were indispensible to the success of the 
conference. She also organized the poster gallery and designed the logos. 

Thanks are also due many people in the McGill community for their contri- 
butions. It was a pleasure to work with volunteers from the newly formed AGM 
student chapter at McGill. These volunteers ran errands and provided staffing 
throughout the conference. Special thanks are due to Albert Mah, who coordi- 
nated the volunteers and contributed many useful suggestions. Frangois Labelle 
arranged publicity announcements. Michael Soss of the GGGG ’98 organizing 
committee provided coordination with that conference. 
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Preface 



Judy Kenigsberg and Lucy St. James of the McGill University School of 
Computer Science staff provided administrative support. Lucy St. James han- 
dled preconference registration. Judy Kenisberg, who staffed the registration 
and information booth, contributed heart and soul to the conference, pulled the 
team together, befriended the participants, and provided excellent staff support 
throughout. 

Thanks are due to the staff of the Redpath Museum, where the conference 
was held, and to the staff of Thomson House, where the lunches were held. Dean 
of Science Alan Shaver, who gave the opening remarks, also provided financial 
backing and suggested the Redpath Museum as a conference venue. 

I wish to thank these people and all the many others who contributed to the 
success of the conference. 

GD ’99 is to be held in the Czech Republic, September 15-19, 1999, under 
the sponsorship of DIMATIA and Charles University, with Jan Kratochvil as 
conference chair. 
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Sue Whitesides 
Program Committee Chair 
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Drawing of Two-Dimensional Irregular Meshes 



Alok Aggarwal^, S. Rao Kosaraju^*, and Mihai Pop^ 



^ IBM Research Division; Solutions Research Center; 
Block 1, Indian Institute of Technology; 

Hanz Khas, New Delhi 110016 

^ Department of Computer Science; Johns Hopkins University; 
Baltimore, Maryland 21218 



Abstract. We present a method for transforming two-dimensional ir- 
regular meshes into square meshes with only a constant blow up in area. 
We also explore context invariant transformations of irregular meshes 
into square meshes and provide a lower bound for the transformation of 
down- st aircases . 



1 Introduction 

We investigate the drawing of any vertically-convex irregular mesh graph, G, 
within a square mesh, G' . The drawing must satisfy the following 2 properties: 

1 . G' must be a planar embedding (no crossings) 

2. edges in G must map into rectilinear paths. 

In other words, the embedding is an orthogonal planar drawing. 

An example of such a drawing is shown in figure [D 




Note that G' is planar and every edge of G maps into a path of length < 3 
in G'. 



* Supported by NSF Grant CCR9508545 and ARC Grant DAAH04-96- 1-0013 



S.H. Whitesides (Ed.): GD’98, LNCS 1547, pp. 1998. 

(c) Springer- Verlag Berlin Heidelberg 1998 
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The problem of embedding graphs into orthogonal grids has received consid- 
erable attention. An annotated list of earlier results can be found in More 
recently, several authors have dealt with embedding graphs with maximum de- 
gree 6 into two dimensional grids, achieving grid areas proportional to the square 
of the number of vertices PHEI For special cases better results are known. 
Graphs with maximum degree 6 can be embedded into three dimensional grids 
of volume 0{v}'^) |S|. In the case of trees, two dimensional embeddings of area 
linear in the number of vertices can be obtained uuu 

We address the problem of drawing any vertically-convex mesh (of arbitrary 
aspect ratio) into a square grid (aspect ratio 1) of area 0{n), where n is the 
number of vertices of G. This problem is of great importance in VLSI design. 
Suppose that a function is realized by interconnecting certain existing VLSI lay- 
outs, an example of which is shown in figure El If we enclose the figure within the 
smallest area square (or even a rectangle) then a significant portion of the square 
might be unused. It is advantageous to transform the figure into a square whose 
area is proportional to the original wire area and which does not introduce any 
additional crossings between wires. This problem is solved by our transformation. 




Fig. 2. VLSI layout 



Throughout the paper, when we refer to the area (A) of the grid, we mean 
the footprint area of the mesh, rather than the area of the smallest enclosing 
rectangle. 

In this paper we establish that any vertically-convex mesh of area A can 
be transformed into an 0{A) area square. It was commonly accepted [0| that 
such a transformation is possible even without imposing the vertical-convexity 
constraint. However, we could not establish the general result, and our solution 
to even the specialized class is quite nontrivial. 

We then investigate “context-independent transformations” which must sat- 
isfy the above two properties, and in addition must satisfy an invariance property 
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of Fiat and Shamir We prove that rectangles of area < A can be transformed 
context-independently into 0{A) area squares. We also establish that down- 
staircases (a very special case of vertically-convex meshes) of area A require 
Q{A\ogA) area squares for context-independent transformations. We further 
prove that any context-independent map that maps down-staircases of area A 
into 0{A) area squares must map l7(-\/log A) nodes of some down-staircase into 
a single node of the square. 

2 Terminology 

A histogram is a mesh in the first quadrant such that (0,0) is a node, and, if 
{x, y) is a node, then (a;, y - 1) , (a;, y - 2 ) , . . . , (a;, 0) ; (a; - 1, 0) , (x - 2, 0 ) , . . . , (0, 0) 
are also nodes of the histogram. The set of nodes (0, 0), (1, 0), (2, 0), . . . form 
the base of the histogram (figure Ej). A down- staircase is a histogram in which 
if (x,y) is a node and a; > 0, then (a; — l,y) is also a node of the histogram 
(figure^. An up-staircase is defined similarly (figure EJ. A monotone histogram 
is a down- or up-staircase. A double histogram is a mesh in the first and the 
fourth quadrants in which (0, 0) is a node, and if (x, y) is a node then all the 
grid points between {x,y) and (a;,0) are also nodes of the graph (figure ED . A 
vertically convex mesh is a mesh in which if (a;, yi) and {x, y2) are nodes then all 
the grid points between (a;, yi) and (x, y 2 ) are also nodes of the graph (figure Q. 



3 Transforming Vertically Convex Meshes 

Lemma 1. There exists a constant c such that any rectangle of area A can be 
transformed into a square of area cA 

The proof of the lemma is a simple observation. We can take the rectangle 
and snake it such that the width of the resulting mesh is roughly \[A. Folding 
the rectangle occurs like in figure 0 Finally, we enclose the structure with the 
smallest possible square. It can be easily shown that the wasted space is at most 
a fraction of the area of the mesh. 

Lemma 2. There exist constants ci,C 2 and C 3 such that any down- staircase of 
area A and base length I can be transformed into a rectangle of area ciA -h C 2 P 
and base length C3I. 



Proof sketch: Enclose the down-staircase within another one such that its “height” 
for any x in between 2® -|- 1 and 2®+^ is the same as the height of the given down- 
staircase at 2* -I- 1, as shown in figure 1101 The new down-staircase is of area < 2A; 
it is sufficient to consider its transformation. Let the step heights be do> di, . . . ,dk 

k 

as shown. (The area of the original staircase is A > do di 2^~‘^di while 

z=3 



k 

the area of the new staircase is A' = do di 2^~^di. Hence A' < 2A.) In 

2=2 
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the rest of the proof we consider di to be the height of the vertical slab and 
d', as shown in figure E3 to be the height of the horizontal slab. 





Fig. 3. Histogram Fig. 4. Down-staircase 





Fig. 5. Up-staircase Fig. 6. Double histogram 




Fig. 7. Vertically-convex mesh 



The main idea of the transformation is to fold the new staircase into an almost 
rectangular shape. For this purpose we consider the staircase decomposed into 
horizontal slabs as in figure EJ We start by noticing that if all horizontal slabs 
are “short” we can just enclose the whole staircase in a rectangle of base I and 
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k 



height d'i- We call a slab “short” if d' < 4 • 2*. Under the assumption that 
2—0 

all slabs are short, the area of the rectangle enclosing the whole staircase is 

k k 



^rectangle ^ < 4-2 , 



< 4 • 2^= X) 2* < 8 • 22^= = 

i=l 



8P. 



v’ u’ 



V U 




Fig. 8. Folding a rectangle 



We now describe a method for transforming a down-staircase into a staircase 
in which all horizontal slabs are short. The main idea is illustrated in figure 0 
We fold the “tall” slab in such a way that after folding, the slab still properly 
connects to the two adjacent slabs. 




Fig. 9. Folding a “tall” slab 



In the following we refer to the height of the current slab as h and the width 
as w. If h/2 -I- 2w < I we only fold once, filling as much as possible of the I width 
of the rectangle. The rest of the length is wasted space. If h/2 + 2w > I then 
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we fold the slab such that it fills the whole I and a portion of height h — 21 is 
left. We continue this procedure until the leftover part is “short”. We have thus 
transformed a “tall” slab into a short slab and, possibly, a rectangle of width 1. 
The folding procedure increases the area of the mesh by only a constant factor 
due to the definition of a short slab. Therefore, the rectangular portion has an 
area proportional to that of the original mesh region. We have shown above that 
the “short” slab regions can be wrapped into a rectangle without blowing up 
the area by more than 0{P), thus the total area of the rectangle enclosing the 
folded staircase is A' < c\ A + C2p. 




i=! 




Fig. 10. Staircase 



Lemma 3. There exist constants ci,C2 and C3 such that any histogram of area 
A and base length I can be transformed into a down- staircase of area c\A + 02?^ 
and of base length C3I. 



Proof sketch: All the nodes in the i^^ row are shifted left by i /2 positions or 
until they cannot be shifted, as shown schematically in figure HD A specific 
example is shown in figure o The resulting diagonal lines can be transformed 
into rectilinear paths as shown in figure ESI This results in ci = 4 , C2 = 4 , C3 = 2 . 
As a consequence of Lemmas El and 0 we have: 

Lemma 4. There exist constants ci , C2 and C3 such that any histogram of area 
A and base length I can be transformed into a rectangle of area ciA + C2^^ and 
of base length C3I. 

We now show how to remove the P term from the area. Let a histogram 
consist of p rectangles, the rectangle being U x hi as shown in figure 
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Fig. 11. Conversion of histogram into staircase 




Fig. 12. Example of conversion of his- Fig. 13. Eliminating diagonal lines 
togram into staircase 



Definition: For any c > 0, the histogram is c-skinny if 

p p 

i=l i=l 

Lemma 5. There exist constants ci,C 2 and C 3 such that any histogram of area 
A and length I can be transformed into a ci-skinny histogram of area C 2 A and 
length C3I. 

Proof sketch: Starting from the left, the histogram is partitioned into p sections 
as follows (see figure US). For the i^^ section, h is the minimum length such 
that Ai < If. Then transform the part of A^, above the max{/ii_i, /i^} row by 



^1 








h 




h, 


— 


h 






^2 




k-1 




k-Hl 




P 




L 






1, . 


1 


1 


1 




1 


2 




k-1 


k 


k-Hl 


P 



Fig. 14. Histogram 
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Lemma 0 Let the new height of the section be h'. We can show that 

p p p 

lih'i < c^A and h'^ < c\ kh'^ 

i—1 i—1 i—1 

for some constants ci and C 2 - 

The first inequality immediately follows from Lemma 0 and the requirement 
that Ai < If . The second inequality can also be easily proved. We will actually 
prove a stronger result: h' < cli for some constant c. From the transformation of 
Ai we get that A- = ciAi + C 2 lf and = Csk (according to Lemma0. Since A' is 
the area of a rectangle of height h' and width we can write ciAi+c^lf = czhh^. 
But Ai < If due to the construction and therefore cskh'i < cilf + C 2 lf- We have, 
thus, proved that h' < cU where c = (ci + 02 )! c^- The second inequality is now 
obvious. 

First of all we observe that it may be possible that A > P and there is no 
position for which Ai < If . In this case we can restrict the histogram to the rows 
above height kl, where k is the largest integer such that A > kP. This may blow 
up the area of the lower side of the histogram by at most kP = 0(A). We then 
proceed with the upper side which has area A' < P and are therefore guaranteed 
to be able to split it as described above. 

Second, we restrict the transformation to the section above the max{/ii_i, hi}. 
It is not obvious that the space “wasted” in the lower section is proportional to 
the area of the pieces, nor that the resulting piece is skinny. The proof of this 
property is rather technical and is omitted from this preliminary version. 




Fig. 15 . Partitioning of histogram 



Lemma 6 . There exists a constant ci such that for any constant 02, any C2- 
skinny histogram of area A can be transformed into a rectangle of area C1C2A. 

Proof sketch: Let the histogram be as shown in figure^^ We can assume without 
loss of generality that each hk is a power of 3 (simply enclose each region with 
the smallest rectangle whose height is a power of 3; the area can blow up by 
at most a factor of 3). Among hi, . . . ,hp let hk be a smallest height, and let 
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hk-i < hk+i- Split the Ik length of the Ik x hk rectangle into equal length 
pieces and “fold” them as shown in figure Eni 




1 

k-l h 

k 




1 



k+1 



h 

k-i-1 



Fig. 16. Folding the histogram into a rectangle 



The increase in area can be shown to be < and it results in the removal 

of the hk X Ik rectangle. When we iterate this process until only one rectangle 
remains, we can show that the resulting rectangle satisfies the lemma. 

As a consequence of Lemmas Q El and 0 we have 

Theorem 1. There exists a constant c such that any histogram of area A can 
be transformed into a square of area cA. 

A slightly more complicated construction leads to: 

Theorem 2. There exists a constant c such that any double histogram of area 
A can be transformed into a square of area cA. 



Lemma 7. There exists a constant c such that any vertically- convex mesh of 
area A can be transformed into a double histogram of area cA. 



Proof sketch: Pick any node, ai, on the leftmost column. Draw a rectilinear 
line from a\ until the rightmost column is reached such that each horizontal 
line segment (is left to right and) is as long as possible, and each vertical line 
segment is as short as possible. Then transform the vertically-convex mesh into 
a mesh in which the chosen line is horizontal. The idea of the transformation is 
shown in figure All diagonal lines are embedded onto a grid of suitable size 
in a manner similar to the approach in figure II ,'11 

As a consequence of Lemma Q and Theorem 0 we have: 

Theorem 3. There exists a constant c such that any vertically convex mesh of 
area A can be transformed into a square of area cA. 
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Fig. 17. Transforming a vertically-convex mesh into a double histogram 
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Fig. 18. Idea of transformation 



4 Context-Invariant Transformations 

A planar transformation from a class of graphs into a square is said to be context- 
invariant if every 2 nodes, that are in 2 different graphs of the class, that have 
identical coordinate values w.r.t. the axes, map into the same node of the square. 

We saw in Lemmas that rectangles of area A can be transformed into 0{A) 
area square. However the transformation is not context-invariant. We show in 
the next theorem that context-invariant transformations exist for rectangles. 
Fiat and Shamir 0] gave a nice context-invariant mapping of rectangles of area 
< A into a 0{A) area square. (Their final shape is not an exact square but that 
is not a critical factor). However their mapping results in a non-planar graph. 

Consider all the rectangles of area < A drawn in the first quadrant such that 
2 sides of each rectangle coincide with the 2 axes. 

Theorem 4. There exists a constant c such that any rectangle of area < A can 
be transformed context-invariantly into a square of area cA. 

Every rectangle of area < A is a sub-rectangle of one of the log A rectangles 
shown in figure II HI Hence it is sufficient to consider the transformation of the 
log A rectangles. Observe that any 2 nodes (e.g. P and Q) which cannot be part 
of the same rectangle can map into the same node of the square. We fold the 
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Fig. 19. Staircase containing all rectangles of area < A 



1 X A rectangle at the a-line, and then fold the resulting 2 x (^ -|- 1) rectangle 
at the /3-line, and so on. We stop the folding when the base length is about \J~A. 
We similarly fold the rectangles of base length > \[A toward the y-axis. Note 
that for every two rectangles the overlapping area is folded at the same time, 
thus insuring the context invariance property. 

We can show that these steps form a proper transformation. We might ask 
whether the result extends to down-staircases. 

Theorem 5. Any context-invariant transformation from down-staircases of area 
A requires an f2{AlogA) area square. 




Fig. 20. Transformation from down-staircases of area < A requires an 
Q{A\ogA) square 
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Proof sketch: No two nodes in figure 1201 can map into the same node of the 
square since any two nodes in it can belong to the same down-staircase of area 

A/2 

A. Hence, the minimum area of the square is ^ = f^{AlogA). 

Theorem 6. Any context-invariant mapping from down-staircases to 0{A) area 
squares results in f2{\/log A) distinct nodes of a down-staircase of area A being 
mapped into the same node of the square. 

Proof sketch: There are f2{A^f log A) distinct points (a;, y) such that xy < 

A 

(since the area of the staircase is -y=^l > ^ log A = A^flogA.) Out of 

i 

these Q{Ay/\ogA) points, f2{y/logA) points get mapped into the same point of 
the square (since the area of the square is 0(A).) There exists a down-staircase 
of area A which includes any y/logA such points. 



5 Achieving Bounded Path Lengths 



Throughout the paper we have not imposed any constraints on the lengths of 
the paths connecting the original vertices in the embedding. Using the approach 
from figure 0 the paths can have a length proportional to the width of the mesh. 

However the fold can be corrected by keeping the right column edge lengths 
(like (u, u')) at unity and by increasing the lengths of the first i edges of the 
column (from the right) to 10 as shown in figure ETI ldue to a minor technicality 
some of the lengths become 11). 

In the following paragraphs we detail the folding procedure. The main idea 
is to correct the folding scheme of figure 0 We will fold the vertical paths in the 
exact way presented in the figure. The horizontal paths, however, will gradually 
be folded from the vu position to the v'u' position. 

The first step is to quadruple the area of mesh, replacing each original mesh 
square with four squares the same size as the original. This enables us to connect 
vertices of the mesh that are not situated on the same row or column. We consider 
the “mesh” -coordinates of each point in the original mesh to be (i,j), where i 
is the horizontal coordinate, starting from 0 in the rightmost column, and j is 
the vertical coordinate, starting from 0 in the row at which we start to fold the 
mesh. Thus, row 0 is unchanged by the fold, while row 1 is the first row that 
gets modified. 

In the folded mesh, the j coordinate is the distance from the fold along the 
path on which the original vertical edges were folded (lines uu',vv' in figure EJ. 
We denote the new coordinates of (i,j) in the folded mesh by (i' ,j')- Due to the 
doubling of the grid, i' = 2i. In order to achieve a smooth transition in the fold 
j' is described by the following formula: 



./ 

J = 



lOj j < i 
8i -\- 2j otherwise 



( 1 ) 
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Fig. 21. Actual folding 



Whenever lOj = 2i or lOj = 6i -I- 2 (vertices 65 and C3 in figure It! Ill the 
vertices would overlap with corners of the folding path. In order to avoid this 
case we need to subtract 1 from j' . 

It is obvious that the edge lengths along the vertical paths are constant (the 
mapping function is linear in j). We only need to show that the horizontal paths 
are of constant length too. It is easy to see that, for any j, and for all i < j, the 
Manhattan distance along the grid between {i' ,j') and — l,f) is constant and 
equal to 2. This situation occurs in the stable position after the fold. The folded 
vertical paths “gain” a length of 4 at every 90° turn. This means that path i 
needs to be 4 units longer to reach the same position as path i — 1 (i.e. after 
the first turn in figure EH the ends of the paths are vertically aligned). After 
two 90° turns the “distance” between adjacent paths increases to 8, situation 
captured by the 81 -I- 2j term in equation ^ Since the original distance between 
two vertices is equal to 2, and all vertices are equally spaced on the vertical paths 
during the fold (the “vertical” distance is 10) the distance between two adjacent 
vertices and (i' ± l,j')) is 2 -|- 4 • number of turns. After two turns the 

distance is not longer than 10. The adjustment, made in order to prevent vertices 
from lying on the corner of a vertical path, only modifies the distance between 
adjacent vertices by 1 unit (e.g. the distance between bg and 65 is 3 instead of 2 
and between 65 and C5 is 11 instead of 10 in figure HJ. Therefore, the length of 
the paths in the folded mesh is still constant. 

Similarly we can perform a 90° fold. The equation for j' is: 



•/ 



J 



6 j j < i 

Ai + 2j otherwise 



( 2 ) 
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It follows immediately from the discussion on the 180° fold that the 90° fold 
only increases path lengths by a constant amount. 

Using the 90° and 180° turn operators we can insure that the path lengths 
increase only by a constant factor in some of the results presented in the paper. 
All results, up to lemma 0 inclusively, hold when the two operators are used to 
insure a constant blow-up in path lengths. For the remaining part of the paper 
we believe the bound on path lengths holds, however we have not been able to 
examine all technical details yet. The main difficulty stems from the need to fold 
the same region several times. The straightforward approach requires a doubling 
of the mesh dimensions at each fold. 
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Abstract. In this paper we introduce the quasi-upward planar drawing 
convention and give a polynomial time algorithm for computing a quasi- 
upward planar drawing with the minimum number of bends within a 
given planar embedding. Further, we study the problem of computing 
quasi-upward planar drawings with the minimum number of bends of 
digraphs considering all the possible planar embeddings. The paper con- 
tains also experimental results about the proposed techniques. 



1 Introduction 

An upward drawing of a digraph is a drawing such that all the edges are repre- 
sented by curves monotonically increasing in the vertical direction. A digraph is 
upward planar if it has a planar upward drawing. 

Planar upward drawings have been deeply investigated and several theoret- 
ical and application-oriented results can be cited in this intriguing field. What 
follows is a limited list containing a few examples (a survey on upward planarity 
can be found in cni). Upward planarity of specific families of digraphs has been 
studied in: planar st-digraphs 0|7], embedded and triconnected digraphs 0, 
single source digraphs [IS SI, bipartite digraphs 0, outerplanar digraphs ini, 
trees ES, and hierarchical digraphs ES The NP-completeness of upward pla- 
narity testing is proved in ||. Further, an impressive set of results on upward 
drawings can be found in the literature on ordered sets. 

Despite such a long list of results, upward planar drawings have found limited 
applicability. The reasons for this are mainly in the tightness of the upward 
planar standard that can be satisfied for “a few” digraphs. Also, the applications 
require very often a similar but slightly different standard, where the drawing is 
upward “as much as possible”. This is the case, for example, of Petri Nets or of 
certain types of SADT diagrams. 

In this paper we introduce and investigate quasi-upward planar drawings. A 
quasi-upward drawing T of a digraph is such that the horizontal line through 
each vertex “locally splits” the incoming edges from the outgoing edges. More 
formally, for each vertex v there exists a connected region R of the plane, properly 

* Research supported in part by the ESPRIT LTR Project no. 20244 - ALCOM-IT 
and by the CNR Project “Geometria Computazionale Robusta con Applicazioni alia 
Grafica ed al CAD.” 
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containing v, such that, in the intersection of R with F, the horizontal line 
through V separates the incoming edges from the outgoing edges. Examples of 
quasi-upward (planar) drawings of Petri Nets are shown in Fig. ^ 




Fig. 1. Two quasi-upward drawings of Petri Nets. 



Observe that an upward drawing is quasi-upward and that, while an upward 
drawing requires the acyclicity of the digraph, any digraph can be drawn quasi- 
upward. 

In a quasi-upward drawing we call bend a point on an edge where the tangent 
moves from the interval 0, tt to the interval tt, 2tt or viceversa. In other words, a 
bend is a point on an edge where the edge is tangent to a horizontal line. The 
quasi-upward drawings of Fig. da and Fig. db have 2 and 8 bends, respectively. 
An upward drawing is a quasi-upward drawing with 0 bends. 

The main contributions of this paper are summarized as follows: we introduce 
the quasi-upward planar drawing convention (Section Eli we give a polynomial 
time algorithm for computing a quasi-upward planar drawing with the minimum 
number of bends of an embedded “bimodal” digraph and show how to extend the 
technique to deal with non-bimodal digraphs; we use a min-cost flow technique 
that unifies the techniques for orthogonal drawings presented in with those 
presented in |3| for upward planarity (Section OJ. 

Motivated by the practical applicability of quasi-upward planar drawings we 
study the problem of computing quasi-upward planar drawings with the min- 
imum number of bends of digraphs considering all the possible planar embed- 
dings. Thus, we present: lower bounds techniques for quasi-upward planar draw- 
ings (Section Ej); a branch-and-bound algorithm for computing a quasi-upward 
planar drawing with the minimum number of bends of a biconnected digraph 
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(Section 0. Such a technique is a variation of the technique presented in 0 
for orthogonal drawings and can be used for each biconnected component of a 
digraph, constituting the basis of a powerful drawing heuristic. Further, it al- 
lows to test if the digraph is upward planar. An implementation of the above 
branch-and-bound algorithm and the results of experiments performed on a test 
suite of 300 biconnected digraphs with number of vertices in the range 10 — 200. 
The experiments show a reasonable time performance in the selected range. 
(Section 0). 

2 Preliminaries 

We assume familiarity with planarity and connectivity of graphs m- Since we 
consider only planar graphs, we use the term embedding instead of planar embed- 
ding. The following definitions, usually introduced for graphs, are used here for 
digraphs. Let G be a biconnected digraph. A split pairoi G is either a separation- 
pair or a pair of adjacent vertices. A split eomponent of a split pair {it, f } is either 
an edge (u, v) or a maximal subgraph G of G such that G contains u and v, and 
{u, ?;} is not a split pair of G. A vertex w distinct from u and v belongs to ex- 
actly one split component of {u, z)}. Suppose Gi, . . . , Gk are some pairwise edge 
disjoint split components of G with split pairs ui,vi . . .Uk,Vk, respectively. The 
digraph G' C G obtained by substituting each Gi {i = 1, . . . ,k) with any simple 
path Pi between Ui and Vi in Gi is a partial digraph of G. Paths pi are called 
virtual paths. We denote the set of the edges of the virtual paths of G' and 
we denote the set of the edges of G' that are not in the virtual paths. 

We say that Gi is the pertinent digraph of pi and that pi is the representative 
path of Gi . 

Let (j) be an embedding of G and (j)' an embedding of G' . We say that (p 
preserves p' if G(^, can be obtained from G^f, by substituting each component Gi 
with its representative path. 

In the following we revise SPQR-trees 0, with the purpose to use them 
to decompose digraphs instead of graphs. SPQR-trees are closely related to the 
classical decomposition of biconnected graphs into triconnected components m 
Let {s,t} be a split pair of G. A maximal split pair {u,v} of G with respect to 
{s,t} is a split pair of G distinct from {s,t} such that for any other split pair 
{u' ,v'} of G, there exists a split component of {u' ,v'} containing vertices u, 
z), s, and t. Let e{s,t) be an edge of G, called reference edge. The SPQR-tree 
T of G with respect to e describes a recursive decomposition of G induced by 
its split pairs. Tree T is a rooted ordered tree whose nodes are of four types: 
S, P, Q, and R. Each node /i of T has an associated biconnected multigraph 
containing directed and undirected edges, called the skeleton of /z, and denoted 
by skeleton{fjL) . Also, it is associated with an edge of the skeleton of the parent 
V of /i, called the virtual edge of /i in skeleton(y) . Tree T is recursively defined 
as follows. If G consists of exactly two parallel edges between s and t, then T 
consists of a single Q-node whose skeleton is G itself (trivial case). If the split 
pair {s, t} has at least three split components Gi, • • • , Gfc (A: > 3), the root of T is 
a P-node /i. Graph skeleton{p) consists of k parallel undirected edges between s 
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and t, denoted ei, • • • , efc, with ei = e. Otherwise, the split pair {s, t} has exactly 
two split components, one of them is the reference edge e, and we denote with 
G' the other split component. If G' has cutvertices ci,---,Cfc-i {k > 2) that 
partition G into its blocks Gi, • • • , Gk, in this order from s to t, the root of T is 
an S-node fj,. Graph skeleton(^) is the cycle of undirected edges eo,ei, • • • ,Cfe, 
where Cq = e, cq = s, Ck = t, and Cj connects Cj_i with Ci (t = 1 • • • fc). If none 
of the above cases applies, let {si, ti}, • • • , {sk, tk} be the maximal split pairs of 
G with respect to {s, t} {k > 1), and for z = 1, - ■ ■ ,k, let Gi be the union of all 
the split components of {si^U} but the one containing the reference edge e. The 
root of T is an R-node /i. Graph skeleton{^) is obtained from G by replacing 
each subgraph Gi with the undirected edge Cj between Si and ti. 

Except for the trivial case, /i has children ^i, • • • , /ij, in this order, such that 
Hi is the root of the SPQR-tree of graph Gi U with respect to reference edge 
Ci (z = 1, - ■ ■ ,k). Edge is said to be the virtual edge of node Hi in skeleton^n) 
and of node h in skeleton{Hi) ■ Digraph Gi is called the pertinent digraph of node 
Hi, and of edge Cj. 

The tree T so obtained has a Q-node associated with each edge of G, except 
the reference edge e. We complete the SPQR-tree by adding another Q-node, 
representing the reference edge e, and making it the parent of h so that it 
becomes the root. Let ^ be a node of T. We have: if h is an R-node, then 
skeleton{H) is a triconnected graph; if h is an S-node, then skeleton{H) is a cycle; 
if /z is a P-node, then skeleton{H) is a triconnected multigraph consisting of a 
bundle of multiple edges; and if /i is a Q-node, then skeleton{H) is a biconnected 
multigraph consisting of two multiple edges. The SPQR-trees of G with respect 
to different reference edges are isomorphic and are obtained one from the other 
by selecting a different Q-node as the root. Hence, we can define the unrooted 
SPQR-tree of G without ambiguity. The SPQR-tree T of a digraph G with n 
vertices and m edges has m Q-nodes and 0{n) S-, P-, and R-nodes. Also, the 
total number of vertices of the skeletons stored at the nodes of T is 0{n). 

By applying the above definitions, the skeletons of the nodes of T contain 
both directed and undirected edges. To avoid this, we modify T as follows. For 
each node /i, each virtual edge (u, v) of skeleton{H) is replaced by any simple 
path {virtual path) of the pertinent digraph of {u,v) between u and v. Also, we 
call reference path the path that substitutes the reference edge. Observe that, 
after this modification, skeleton{H) is directed and is a subgraph of G. 

An SPQR-tree T rooted at a given Q-node represents all the planar embed- 
dings of G having the reference edge (associated to the Q-node at the root) on 
the external face. 

3 Quasi-Upward Planarity 

Let G be an embedded planar digraph. A vertex of G is himodal if its incident list 
can be partitioned into two possibly empty linear lists one consisting of incoming 
edges and the other consisting of outgoing edges. If all its vertices are bimodal 
then G and its embedding are called himodal. A digraph is himodal if it has a 
planar bimodal embedding. 
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We define the operation insert-switch on an edge of an embedded digraph. 
Operation 2fc-insert-switch on edge (u, v) removes (u, v) and inserts vertices 
vi,ui,V2,U2,...,Vk,Uk and edges (u,vi), (ui,vi), (ui,r’2), (m 2, t'2), • ■ • , (wfc, f)- 
See Fig. 0 The new path is embedded in place of {u,v). Observe that vertices 
u\, . . . ,Uk {v\, . . . , Vk) are k new sources (sinks) of the digraph. 



O — >0 O — — ■ — <D 

U V U Vi U 1 V2 U2 V 



Fig. 2. An example of 4-insert-switch. 



Let G be a bimodal digraph that is not upward planar and let G' be a digraph 
obtained from G by performing one 2fc-insert-switch operation on (u,v). Sup- 
pose that G' is upward planar (and, of course, acyclic) and consider an upward 
drawing F' of G'. Reverse the direction of edges (iti, ui), (^ 2 ,^ 2 ), . . . , (uk,Vk) of 
T'. We have that F' is a quasi-upward drawing with 2k bends of G, where edge 
{u, v) of G is “represented” by the drawing of path vi,ui,V 2 ,U 2 , ■ ■ ■ ,Vk,Uk of G'. 
Each of vi,ui,V 2 ,U 2 , . . . ,Vk,Uk corresponds to a bend. 

In Fig. 0a we show a bimodal digraph that is not upward planar. Observe 
that if we perform a 2-insert-switch operation on edge (3,2), then the resulting 
digraph becomes upward planar. See Fig. 0b. Observe that the resulting draw- 
ing can be simply modified into a quasi-upward planar drawing of the original 
digraph (Fig. 0c) by reversing one edge. The best aesthetic results are obtained 
by smoothing the bends (Fig. 0d). 

We consider the following problem. Given a bimodal digraph G we want to 
determine a quasi-upward planar drawing of G with the minimum number of 
bends. In this section we restrict our attention to a given planar embedding, 
while in the next sections we shall consider also the possibility of changing the 
embedding. 

Let G be an embedded bimodal planar digraph and let S (T) be the set of its 
sources (sinks). Let / be a face of G. We visit the contour of / counterclockwise 
(i.e. such that the face remains always to the left during the visit). Let 2n/ be 
the number of pairs of consecutive edges ei, 62 such that the the direction of ei 
is opposite to the direction of 62 - The capacity c/ of / is nj — 1 if / is an internal 
face and ny -|- 1 if / is the external face. 

Lemma 0 and Theorem 0 have been shown in 0. 

Lemma 1. The sum of the capacities of all the faces is equal to 151 -I- \T\. 

An assignment of the sources and sinks of G to the faces such that the 
following properties hold is upward consistent. (1) a source (sink) is assigned to 
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(a) 



(b) 




(c) 



(d) 



Fig. 3. (a) A non upward planar digraph; (b) An upward planar drawing after a 
2-insert-switch operation; (c) A quasi-upward planar drawing; and (d) A quasi- 
upward planar drawing with smoothed bends. 



exactly one of its incident faces; for each face /, the number of sources and sinks 
that are assigned to / is equal to c/. 

Theorem 1. Let G be an embedded bimodal digraph; G is upward planar if and 
only if it admits an upward- consistent assignment. 

Consider again Fig. 0 If we interpret the 2-insert-switch in terms of capacity 
and assignment we have that before the insertion (Fig. 0a) the capacity of the 
external face h was equal to 2 while the capacity of all the internal faces was 
equal to 0. Also, the external face had one source (vertex 0) to accomodate while 
face / had one sink (vertex 4). Hence, we had a surplus of capacity on h and 
a deficiency of capacity on /. The effect of the insertion was to increase the 
capacity of both / and h of one unit. At the same time we have now two more 
sources and sinks to assign. However, such vertices can be both assigned to h. 

A maximal path whose edges share two (not necessarily distinct) faces is a 
boundary path. 

We extend the concept of upward consistent assignment, that has been used 
for upward planarity, by introducing a new flow network that models quasi- 
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upward planarity of a digraph within a given planar embedding. Namely, each 
quasi-upward planar drawing corresponds to a flow in the network. Further, each 
flow corresponds to an equivalence class of quasi-upward planar drawings. For 
each element of the equivalence class we have the same number of bends along 
each boundary path. Also, for each sink t, consider the horizontal line A through 
t and a sufflciently small region R properly enclosing t; the intersection between 
R and the halfplane “above” A is a subset of the same face for each drawing of 
the class; and for each source s, consider the horizontal line A through s and a 
sufficiently small region R properly enclosing s; the intersection between R and 
the halfplane “below” A is a subset of the same face for each drawing of the 
class. 

The flow network N associated to an embedded bimodal digraph G is deflned 
as follows: nodes have supplies and demands; arcs have a capacity (3 and a cost 
X; the nodes of N are the sources, the sinks, and the faces of G; a source or sink 
node V produces a flow a{v) = 1; a face-node / consumes a flow a{f) = Cf, where 
c/ is the capacity of /. Observe that if / is internal and is a directed cycle then 
Cf = —1. This corresponds to a production of flow rather than a consumption; 
for each vertex-node v we have one arc from v to every its incident face /. Such 
arcs have zero cost and capacity (3{v, /) = 1; for each boundary path between 
faces / and g we have arcs (/, g) and {g, /). Arcs have cost %(/, g) = x{g, /) = 2 
and a capacity P{f,g) = /3{g, /) = -koo. 

By Lemma D it follows that the sum of the amounts of flow supplied by 
the vertex-nodes is equal to the sum of the amounts of flow consumed by the 
face-nodes. Considering that arcs between face-nodes have infinite capacity we 
have: 

Lemma 2. Network M admits a feasible integer flow. 



Theorem 2. Let G be an embedded bimodal digraph and let Af be the associated 
flow network. For each feasible integer flow a ofJV there is a quasi-upward planar 
drawing F of G, and for each quasi-upward planar drawing F of G there is a 
feasible integer flow a of Af such that: 

1. The number of bends of F is equal to the cost of a. 

2. The number of bends in F along each boundary path between faces f and g 
is equal to %(/, g)a{f, g) + x{g, f)cr{g, f) ■ 

3. for each sink t, let f be the face such that a(t, f) = 1; consider in F the hor- 
izontal line A through t and a sufficiently small region R properly enclosing 
t; the intersection between R and the halfplane “above” A belongs to f; and 

4-. for each source s, let f be the face such that tj(s, /) = 1; consider in F 
the horizontal line A through s and a sufflciently small region R properly 
enclosing s; the intersection between R and the halfplane “below” A belongs 
to f. 

Proof. Intuitively, k units of flow from face / to its adjacent face g through the 
arc (/, g) associated to boundary path p (shared by / and g) represent a 2fc-insert 



22 



Paola Bertolazzi, Giuseppe Di Battista, and Walter Didimo 



switch operation performed on any edge e of p. The flow cr is interpreted as an 
assigment of sources and sinks to faces. The set of sources and sinks includes 
the new sources and sinks introduced by the insert switch operation. The 2k 
new sources and sinks introduced on e from the 2/c-insert-switch operation are 
assigned to face g. An analysis based on the effect of the insert switch opera- 
tions on the capacity of the faces shows that the resulting assignment is upward 
consistent. 

Theorem allows to reduce the problem of finding a quasi-upward planar 
drawing of an n- vertex embedded bimodal digraph G with the minimum number 
of bends to a minimum cost flow problem. The flow problem can be solved in 
time 0(T(n)), where T{n) is equal to 0{n^logn), using a standard technique 
(see e.g. ^). The T{n) bound can be reduced to O^'nJ^^y/logn) with a technique 
presented in m 

If the digraph is not bimodal, a planarization technique can be used in a 
preprocessing step, where dummy vertices are introduced into the digraph to 
represent crossings. 

Once the min-cost-flow problem has been solved a planar st-digraph including 
the given digraph can be found in time 0(n + b), where b is the number of bends, 
with a technique similar to the one presented in P| to “saturate” the faces of an 
upward planar embedding. The enclosing planar st-digraph can be drawn with 
any of techniques for drawing planar st-digraphs (see 0) and, eventually, the 
bends are smoothed. 



An example of drawing constructed with the algorithm presented in this 
section is shown in Fig. 0 




Fig. 4. A quasi-upward planar drawing 
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4 Lower Bounds on the Number of Bends of Quasi-Upward 
Planar Drawings 

Let G = (V, E) be a biconnected bimodal digraph and F be a quasi-upward 
planar drawing of G, we denote by b(E) the total number of bends of E and by 
bE'{r) the number of bends along the edges oi E' {E' C E). Let Gi = {Vi, Ei), 
i = 1, . . . ,khe k subgraphs of G such that EiHEj = j, and = E. 

Let Ei be an optimal quasi-upward planar drawing of Gi. We have: 

Property L b{E) > ^(U)- 

Let G'^, be an embedded partial digraph of G with respect to the split com- 
ponents Gi, . . . ,Gk- 

Let E^, be a quasi-upward planar drawing of G'^, . Suppose E^, is such that 
b^nonvirt {E^,) is minimum. Consider an embedding ^ of G that preserves (j)' and 
an optimal quasi-upward planar drawing E^ of G^. 

Lemma 3. 6^;non„irt(r'^,) < 6^;non«irt (C^). 

Proof. Suppose, for a contradiction, that bE^onvin {E'^,) > {E^). For each 

component Gi of G, the virtual path pi is represented in E^ by a polygonal line. 
We can derive from E,p a quasi-upward planar drawing of G^, by simply 
substituting the quasi-upward planar drawing of Gi with pi, for each Gi. It is 
easy to see that 6£;non„irt(F0/) = bE^onvi,.t{E^). Hence, E has less bends along 
the edges that do not belong to virtual paths than E'^,, a contradiction. 

From Property n and Lemma 01 it follows a first lower bound. 

Theorem 3. Let G = {V,E) be a biconnected bimodal digraph and G'^, an em- 
bedded partial graph of G. For each virtual path pi of G' , i = 1, ... ,k, let bi 
he a lower bound on the number of bends of any quasi-upward planar drawing 
of the pertinent digraph Gi of pi. Consider a quasi-upward planar drawing E^, 
of G'^, such that 6^;nont.irt (F^, ) is minimum. Let (j) be an embedding of G that 
preserves 4>' , consider any quasi-upward planar drawing E^ ofG^. We have that: 

b{E^) > bE^or,virt{E'^,) + Yl,i=i,...,k^i- 

A quasi-upward planar drawing F^/ of G^i such that (F^,) is min- 

imum can be easily obtained by using the algorithm presented in Section 01 
Namely, when two faces / and g share a virtual path, the corresponding edge of 
Af in the minimum cost flow problem is set to zero. 

A further lower bound is described in the following property. 

Property 2. Let G^ be an embedded bimodal digraph and G^, an embedded 
partial digraph of G, such that (j) preserves (jf . Consider an optimal quasi-upward 
planar drawing F^, of G'^, and a quasi-upward planar drawing F^ of G^. Then 
we have that: ^(F^) > b{E^,) 

The next theorem allows us to combine the above lower bounds into a hybrid 
technique. 
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Theorem 4. Let be an embedded biconnected bimodal digraph and G'^, an 
embedded partial graph of G. Consider a subset of the set of the virtual 

paths of G'^i. Denote by Ep the set of edges of For each virtual path 

Pj ^g ^ lower bound on the number of bends of the pertinent 

graph Gj of pj . Consider a quasi-upward planar drawing F'^, of G'^, , such that 
bpnonvi-rt{F'^i) + bEF{r^') minimum. Let be a quasi-upward planar drawing 
of Gcj), we have that: b{r^) > bpn onvirt i.^4,1) + + Hj-.p^^pvirt bj 

5 Computing Optimal Drawings with Branch and Bound 
Techniques 

Let G be a biconnected bimodal digraph. We describe a technique for enumer- 
ating all the possible quasi-upward planar drawings of G and strategies to avoid 
examining all of them in computing a quasi-upward planar drawing with the 
minimum number of bends. Such a technique is a variation of the one presented 
in PI for orthogonal drawings. 

The enumeration uses the SPQR-tree T of G. Namely, we enumerate all the 
quasi-upward planar drawings of G with edge e on the external face by rooting T 
at e and exploiting the capacity of T rooted at e in representing all the embed- 
dings having e on the external face. A complete enumeration is done by rooting T 
at all the possible edges. Actually, in general, T represents also the embeddings of 
G that are not bimodal. To solve this problem, before computing T, we perform 
an “expansion” on all the vertices of G with more than one incoming or outgoing 
edge. The expansion replaces vertex v with its incoming edges (ui,v ), . . . , {uh, v) 
with h > 1 and its outgoing edges (u, wi), . . . , (v,Wk) with k > 1 with vertices 
vi,V2 and edges (ui,ui), . . . , (uh,vi), (ui,U2), (t’2, wi), • ■ • , (t’2, Wfc)- We call edge 
(ui,U2), that represents vertex v, straight edge. Now, the edges incident on v 
must enforce the bimodal constraint in any computed embedding. Observe that 
the straight edges are boundary paths. 

We encode the embeddings of G as follows. We visit T in such a way that a 
node is visited after its parent, e.g. depth-first or breadth- first. This induces a 
numbering 1 , . . . , r of the P- and R-nodes of T. We define an r-uple of variables 
X = xi, . . . , Xr that are in one-to-one correspondence with the P- and R-nodes 
fix, ..., Hr of T. Each variable Xi of X that corresponds to an R-node pii can be 
set to three values corresponding to two swaps of the pertinent digraph of jn 
plus one unknown value. Each variable Xj of X that corresponds to a P-node 
Hj with degree k (including the reference edge) can be set to up to (fc — 1)! 
values corresponding to the possible permutations of the pertinent digraphs of 
the children of pj plus one unknown value. Unknown values represent portions 
of the embedding that are not yet specified. 

A search tree B is defined as follows. Each node P of B corresponds to a 
different setting Xp of X . Such a setting is partitioned into two contiguous (one 
of them possibly empty) subsequences X\, . . . ,Xt and Xh+i, . . . , Xr. Elements of 
the first subsequence contain values specifying embeddings, while elements in 
the second subsequence contain unknown values. The leaves of B are in corre- 
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spondence with settings of X with no unknown values. Internal nodes of B are 
in correspondence with settings of X with at least one unknown value. The set- 
ting of the root of B consists only of unknown values. The children of [3 (with 
subsequences x\, . . . ,Xh and Xh+i, ■ ■ ■ , Xr) have subsequences x\, . . . , Xh+i and 
Xh+ 2 , • ■ • ) Xr, one child for each possible value of Xh+i- 

Observe that there is a mapping between the embedded partial digraphs of 
G and the nodes of B. Namely, the embedded partial digraph Gp of G associated 
to node (3 oi B with subsets x\,. . . ,Xh and Xh+i, ■ ■ ■ ,Xr is obtained as follows. 
First, set Gp to skeleton{^i) , embedded according to x\. Second, substitute each 
virtual path pi of with the skeleton of the child pi of pi, embedded accord- 
ing to Xi, only for 2 < i < h. Then, recursively substitute virtual paths with 
embedded skeletons until all the skeletons in {skeleton{pi ) , . . . , skeleton{ph)} 
have been used. 

We visit B breadth- first starting from the root. At each node P of B with 
setting Xp we compute a lower bound and an upper bound of the number of 
bends of any quasi-upward planar drawing of G such that its embedding is 
(partially) specified by Xp. The current optimal solution is updated accordingly. 
The subtree rooted at /3 is not visited if the lower bound is greater than the 
current optimum. 

For each /3, lower bounds and upper bounds are computed as follows. 

1. We construct Gp by using an array of pointers to the nodes of T. 

2. We compute lower bounds by using the results presented in Section 0 
Gjs is a partial digraph of G, with embedding derived from Xp. Let 
^pjnonvirt^ be the set of edges (not) belonging to the virtual paths of Gp. 
For each virtual path pi of Gp consider the pertinent digraph Gi of pi and 
compute a lower bound bi on the number of bends of Gi . Denote by 

the set of virtual paths pi such that bi = 0 and hy Ep the set of edges of 
such paths. 

We apply the algorithm presented in Section 0 on Gp, assigning zero costs 
to the arcs of Af associated to the virtual paths of Gp that are not in 
In order to have no bends on the straight edges we set the cost of the cor- 
responding arcs to infinity. Observe that after this setting the flow problem 
remains feasible. In fact, the straight edges cannot cause cycles in any partial 
digraph. We obtain a quasi-upward planar drawing Fp of Gp with minimum 
number of bends on the set U Ep. Let b{Fp) be such a number of 

bends. Then, by Theorem 0 we compute the lower bound Lp at node P, as 
Lp = b{Fp) + J2i-.pi^P'>"t 

Lower bounds bi can be pre-computed with a suitable pre-processing by 
visiting T bottom-up. The pre-computation consists of two phases, (a) We 
apply the algorithm presented in Section 0 on the skeletons of each R- and 
P-node, with zero cost for the arcs associated to the virtual paths; in this 
way we associate a lower bound to each R- and P-node of T. Note that these 
pre-computed bounds do not depend on the choice of the reference edge, so 
they are computed only once, at the beginning of the computation, (b) We 
visit T bottom-up summing for each node p the lower bounds of the children 
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of /i to the lower bound of /i. Note that these pre-computed bounds depend 
on the choice of the reference edge, so they are re-computed at any choice 
of the reference edge. 

3. We compute upper bounds. Namely, we consider the embedded partial di- 
graph G /3 and complete it to a pertinent embedded digraph Gif,. The em- 
bedding of is obtained by substituting the unknown values of Xj^ with 
embedding values in a random way. Then we apply the algorithm presented 
in Section 0 to so obtaining the upper bound. We also avoid multiple 
generations of the same embedded digraph in completing the partial digraph. 

A further speed-up of the branch-and-bound technique can be obtained by 
suitably choosing the paths used as virtual paths in the skeletons of the nodes 
of T. We call switches the vertices of a simple path where the direction of the 
edges changes. Observe that, in general, a boundary path with a few switches 
has less degrees of freedom in a quasi-upward planar drawing than a path with 
many switches. Hence, to have tighter lower bounds it is a good choice to select 
as virtual paths those that have a minimum number of switches. The feasibility 
of this approach is guaranteed by the following theorem. 

Theorem 5. Let G he a digraph with n vertices and m edges and let u and v be 
two distinct vertices ofG. A simple path with the minimum number of switches 
between u and v can be computed in 0{n + m) time. 

Proof. An algorithm that works in 0{n + m) time computes a sequence of depth- 
first-search, “moving from u to and alternating in considering the direction 
of the edges. 

6 Experimental Results 

The algorithm presented in Section 0 has been implemented and extensively 
tested. It constitutes a fundamental “subroutine” for the implementation of the 
algorithm presented in Section 0 for biconnected digraphs. We have tested such 
branch and bound algorithm against a randomly generated test suite consisting 
of about 300 digraphs. The test suite is available on the Web and has been gen- 
erated as follows (www.dia.uniroma3.it/people/gdb/wpl2/ L0G.html). Any 
embedded planar biconnected graph can be generated from the triangle graph 
by means of a sequence of insert-vertex and insert-edge operations. Insert- vertex 
subdivides an existing edge into two new edges separated by a new vertex. Insert- 
edge inserts a new edge between two existing vertices that are on the same 
face. We have implemented a generation mechanism that at each step randomly 
chooses which operation to apply and where to apply it. After the generation, 
edges are randomly oriented, and then the digraph is discarded if it is not bi- 
modal. The density (number of edges over number of vertices) of the generated 
graphs is in the range 1.2-2. 

The implementation uses the C-|— 1- language and the GDToolkit library 
(www. dia. uniromaS . it /people /gdb/wpl2). Experiments have been done with 
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a Sun Ultrasparc 1. The results of the experiments are summarized in the graph- 
ics of Fig. 0 To check the applicability of the algorithm we have measured the 
CPU time (Fig. 0a). To better understand the features of the test suite we have 
measured the number of embeddings (Fig. 0b) and the number of components 
affecting the computation time (Fig. 0c). 



1000 X ■ 1000000 




Fig. 5. Graphics summarizing the experiments: (a) CPU time (seconds), (b) 
number of embeddings (log. scale), and (c) number of components (sum of P 
and R nodes) . The a:-axis represents the number of vertices and in the y-axis we 
give average values. 



All the figures presented in this paper have been drawn with the GDToolkit 
system with an implementation of the algorithms of Sections 0 and 0 

7 Conclusions and Open Problems 

We have presented a new approach in constructing drawings of digraphs. Such 
approach can be considered as an equivalent of the popular topology-shape- 
metrics approach (that constructs orthogonal drawings of undirected graphs; 
see e.g. (El SO]) for drawing digraphs. In fact, the drawing process presented in 
this paper can be seen as a sequence of steps. During the first step a topology is 
found in terms of a bimodal planar embedding of the digraph. Dummy vertices 
representing crossings may be inserted. During the second step a shape is found 
with the minimum number of bends (within the given topology) in terms of 
an intermediate representation of the drawing. During the last step the final 
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drawing is constructed by using any of the technique for drawing planar st- 
digraphs (see 0) and, eventually, the bends are smoothed. Further, for the 
applications for which to have a tidy drawing it is really important, even at 
the expense of a higher computation time, we have shown a branch-and-bound 
technique that minimizes the number of bends of each biconnected component 
by searching all the possible topologies of the component. We have also shown 
that the algorithm has a reasonable time performance for digraphs up to 200 
vertices. 

This paper also opens several problems related to quasi-upward planarity. 
Is there a relationship between the minimum-feedback arc set problem and the 
minimization of the number of bends? How is such minimization related to the 
search of a maximum upward planar subgraph of a given digraph? 
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Abstract. In this paper, we study orthogonal graph drawings in three 
dimensions with nodes drawn as boxes. The algorithms that we present 
can be differentiated as resulting from three different approaches to creat- 
ing 3D-drawings; we call these approaches edge-lifting, half-edge-lifting, 
and three-phase-method. 

Let G be a graph with n vertices, m edges, and maximum degree A. We 
obtain a drawing of G in an n x n x A-grid where the surface area of 
the box of a node v is 0{deg{v)); this improves significantly on previous 
results. We also consider drawings with at most one node per grid-plane, 
and exhibit constructions in an n x n x m-grid and a lower bound of 
hence upper and lower bounds match for graphs with 0{v?) edges. 



1 Introduction 

In this paper, we study orthogonal drawings^ i.e., embeddings in the rectangular 
grid. We will mainly be concerned with drawings in dimension k = S, but need 
the terminology for dimension k = 2 as well. A grid-point is a point in whose 
coordinates are all integer. A grid-hox is the set of all grid-points (a;i, . . . , Xk) G 
satisfying x\ < Xi < xf for some integers x[,x'f, i = 1, . . . , fc. A port of a 
grid-box is a point of the box that is extremal in at least one direction. 

Throughout this paper, a kD orthogonal grid drawing of a graph G is a 
drawing that satisfies the following. Distinct nodes are represented by disjoint 
A:-dimensional grid-boxes. An edge e = (vi,V 2 ) is drawn as a simple path that 
follows grid-lines, possibly bending at grid-points; the endpoints of the path for 
e are ports on the boxes representing vi and V 2 - The intermediate points along 
the path for an edge do not belong to any node box. For A: = 3, the intermediate 
points along the path for an edge also do not belong to any other edge path; for 
k = 2, edge-paths may cross, but not touch or overlap. 

The volume of a 3D-drawing is the volume of the smallest grid-box containing 
the drawing. Often we refer to this bounding box as an A x T x Z-grid. In what 

* These results were part of a PhD thesis at Rutgers University under the supervision 
of Prof. Endre Boros, and done, in part, while the author was working at Tom Sawyer 
Software and funded by the NIST under grant number 70NANB5H1162. Patent on 
these and related results is pending. 
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follows, graph theoretic terms such as node are typically used to refer both to 
the graph theoretic object and to its representation in a drawing. 

The orthogonal drawing style has received much attention in the graph draw- 
ing community, for example 11 out of 43 papers at the last graph drawing con- 
ference 13 pertained to them. The orthogonal drawing algorithms split into two 
major classes. If the maximum degree of the input graph is bounded by twice 
the dimension, then every node can be drawn as a point (we speak of a point- 
drawings) . If the maximum degree exceeds this bounds, then one assigns a box to 
every node (we speak of a box-drawing) . For aesthetical reasons, this box should 
be small relative to the degree of the node. 

1.1 Existing Results 

The problem of 2D-orthogonal drawings has been studied extensively, both for 
point-drawings and for box-drawings. See niEnaEiiinniiiinnis! and the 
references therein. 

3D-orthogonal drawings have been studied almost exclusively for point-draw- 
ings, so for graphs with maximum degree 6 {6- graphs). Rosenberg gave an algo- 
rithm to embed any 6-graph in a grid of volume and showed that this is 

asymptotically optimal d. No bounds on the number of bends are given. Fades, 
Symvonis and Whitesides proposed drawings in an 0{^/n) x 0(^/n) x 0{^/r^)- 
grid with at most 7 bends per edge (S|- They also gave a construction in a 
3n X 3n X 3n-grid with 3 bends per edge. This was later improved by Papakostas 
and Tollis to a grid of volume at most 4.66n^ with 3 bends per edge [ 1 3j . 

For 3D-orthogonal box-drawings, only two results are known to the author. 
Papakostas and Tollis presented an algorithm to embed any graph in a grid of 
volume 0{m?) with at most 2 bends per edge The author, together with 
Shermer, Whitesides and Wismath, studied how to embed the complete graph 
Kn, and established lower bounds 0. 

1.2 Our Results 

In this paper, we review old and present new algorithms for 3D-orthogonal box- 
drawings. These algorithms fit into three very different approaches to creating 
an orthogonal drawing. Two of these approaches have been used PIH] without 
being defined abstractly. 

The first approach, which we call edge-lifting, yields drawings with excellent 
volume-bounds, but nodes may be disproportionally large. The second approach, 
which we call half-edge-lifting, yields drawings in which nodes are proportional to 
the degree of the node, i.e., they are in what we call the degree-restricted model. 
This approach makes it possible to convert many two-dimensional orthogonal 
graph drawings into three-dimensional ones. The third approach is called three- 
phase method, because it mirrors the three-phase method introduced for 2D- 
orthogonal drawings in |2j . 

The second and third approach result in new drawing algorithms with im- 
proved volume bounds. In particular, we improve the volume bound of 0{mf) 
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m to 0{n^), while maintaining the property that the surface area of each node 
is proportional to the degree of the node. We also construct drawings in which 
the nodes are represented by cubes, at a slight increase of the volume to 0{'n?m). 
To our knowledge, this is the first algorithm that draws nodes as cubes. 

The drawings created with the first two approaches can be considered two- 
dimensional in spirit, because they are created by starting with a 2D-orthogonal 
drawing and lifting it into 3D. The third approach works differently, by placing 
nodes directly in three-dimensional space. This enables us to study drawings with 
at most one node per grid-plane. We exhibit constructions that achieve a volume 
of O(n^m), and a volume of 0{y/nrn ), respectively; the latter construction again 
represents nodes as cubes. We also study lower bounds, and prove that no such 
drawing could have less than 0(max{n^, m^}) volume, thus the smaller of our 
constructions is optimal for graphs with 9{n^) edges. 

2 Preliminaries 

In the following, we clarify some terminology used for 3D-drawings. Recall that 
a grid-box in dimension k is the set of all grid-points (a: i, . . . , Xk) S satisfying 
x\ < Xi < xf for some integers x\, x^, f = 1, . . . , fcjjA grid-box is said to have 
width w = X I — x\ 1 and height /i = X 2 — -I- 1; thus we measure the number 

of grid-points across, not the distance between the first and the last grid-point. 
In 3D, we also use the terms depth d = x"^ — x^^ 1, size w x d x h, and volume 

whd. The volume is thus the number of grid-points contained in the grid-box, 
and can never be 0. 

A port of a grid-box is a grid-point in the box that is extremal in at least one 
direction. Ports are classified by their direction of extremity as -|-a;-ports, —x- 
ports, -|-y-ports, etc. An x-port is a port that is either a -l-x-port or a — x-port; 
j/-ports and z-ports are defined similarly. Points that are extremal in more than 
one direction are counted repeatedly as ports. The total number of ports of a 
box is called the surface area of the box; for a box with width w, height h and 
depth d the surface area is thus 2{wh -\-hd-\- wd). 

A z-line is a grid-line that is parallel to the z-axis. A z-plane is a grid- 
plane that is orthogonal to the z-axis. The coordinate of a z-plane is the fixed 
z-coordinate. A z-segment is a segment along a z-line. The terms are defined 
similarly for x and y. A grid-plane is an a;-plane, y-plane or z-plane with integer 
fixed coordinate. 

Let G = (V,E) be a graph, \V\ = n, \E\ = m. Denote by A the maximum 
degree of G. Throughout this paper, we assume that G is connected (any two 
nodes are connected by a path) and simple (no loops and multiple edges), and 
has no nodes of degree 1; we call such a graph normalized. 

^ This paper allows degenerate boxes, i.e., boxes that have dimension 1 with respect 
to one or more coordinate directions. Such degenerateness can be removed by adding 
additional grid-lines, which increases the volume of the drawing by a multiplicative 
constant. 
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2.1 Models for Three-Dimensional Drawings 

In the definition of an orthogonal drawing there are no restrictions on the size 
of node-boxes. However, typically one wants nodes resembling points, therefore 
their boxes should be approximately squares respectively cubes. 

To achieve such drawings, a number of models have been introduced for 2D 
box-drawings. The Unlimited- growth morfeZ imposes no restrictions on the dimen- 
sions of the nodes. The Proportional- growth model demands that the dimensions 
of a node may be only as big as needed for the number of incident edges. The 
Kandinsky-model imposes special conditions on nodes that are placed in the 
same horizontal or vertical range. See for details. 

The Unlimited-growth model transfers directly to 3D. The Kandinsky- model 
can likewise be transferred, but this will not be explored in this paper. There is 
no direct equivalent of the Proportional-growth- model in 3D . We could demand 
that the surface area on each side of the node is only as large as needed for the 
number of incident edges, but this may lead to problems because the number 
of incident edges on the sides may not admit a suitable factorization. Instead, 
we use the Degree-restricted model or dr-model: Informally, a drawing is in the 
degree-restricted model if the surface area of node v is 0{deg{v)) for all nodes 
V. More precisely, a drawing is in the (ci,C 2 )-dr model if for the surface area 
of a node v is at most c\deg(v) -I- C 2 . A drawing is said to be in the degree- 
restricted model if it is in the (ci, C 2 )-dr model for some constants ci, C 2 that are 
independent of the input graph. 

The previous heuristics for 3D-orthogonal drawings worked in the degree- 
restricted model: The algorithm by Papakostas and Tollis yields drawings in the 
(6,0)-dr model Ol . and the drawing of AT„ in an ^ x ^ x ^-grid in ^ is in the 
(2,4)-dr model. Also, every 2D-drawing in the proportional-growth model is in 
the (2, 2)-dr model. 

Drawings in the degree-restricted model may still be unsatisfactory, for ex- 
ample, a drawing where node v has a 1 x 1 x )-box is in the degree-restricted 
model, but the elongated boxes would be considered unpleasant by many users. 
Therefore, we propose another, more stringent model, which we call cube-model: 
The box of v must be a cube whose surface area is proportional to the degree 
of V. Similarly one could define the square-model for 2D-orthogonal drawings, 
but to our knowledge this has not been used. For an algorithm where the aspect 
ratio of node boxes is at most 2, see | 2 |. 

3 Three Approaches to 3D-Orthogonal Box-Drawings 

3.1 Approach I: Lifting Edges 

In this section, we present the first approach to 3D-orthogonal drawings, lifting 
edges^ which has been used in 

The idea is to start with a 2D-orthogonal drawing T which is semi-valid] by 
that we mean that no nodes overlap and no edge crosses a node, but edges may 
overlap each other. Nodes may be boxes or points. 
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Split this drawing F into a number of drawings Fi, . . . , Fg, such that each 
drawing Fi is a valid orthogonal drawing. Here, splitting a drawing means to find 
a partition EiU . . .U Eg of the edges; Fi is then the restriction of F to all nodes 
and the edges in Ei. See also Figured where we split a semi- valid drawing of 
Kg, into four valid drawings. This is similar to the construction used in d- 




Fig. 1. We split a semi-valid drawing of Kg into four crossing-free drawings. 



Now place these crossing-free drawings Fi,...,Fg into 9 z-planes Pi,. .. ,Pg. 
Extend every node to intersect all z-planes P\, . . . ,Pg to obtain an orthogonal 
box-drawing F' , which has the same height and width as F, and depth 6. 






z 




X 




Fig. 2. Finishing the drawing started in Figured 



Using this method, drawings with excellent volume-bounds can be obtained. 
The first theorem results from the construction explained in Figured andd For 
improvements, and the proof of the second theorem, we refer to d- 

Theorem 1. ^ Every simple graph with n nodes, n even, has a box-drawing 
in an n X n X ^-grid with 1 bend per edge. 

Theorem 2. m Every simple graph with n nodes, n = N'^ a perfect square, has 
a box-drawing in a 2N x 2N x ^N^-grid with 3 bends per edge. 

The main disadvantage of this approach is the size of the nodes. In the 
first construction, node v has a 1 x 1 x ^-box, which is in the degree-restricted 
model for the complete graph, but not for an arbitrary graph. In the second 
construction, node v has a 1 x I x |n^ ®-box, which is not in the degree-restricted 
model. 

Thus, even though this approach yields very small volume (in fact, the second 
construction matches asymptotically the lower bound d)i use is of rather 
theoretical nature to explore smallest-possible upper bounds. 
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3.2 Approach II: Lifting Half-Edges 

In this section, we introduce a second approach to 3D-orthogonal box-drawings 
which we call lifting half-edges. Similar as in the previous approach, it starts 
with a 2D-orthogonal drawing and converts it into a 3D-orthogonal drawing. 
As opposed to the previous approach, it does so in a way that ensures that the 
drawings are in the degree-restricted model. 

Again assume that we are given a semi- valid 2D-orthogonal drawing F. Split 
r into two drawings Fh and Fy, where Fh contains all horizontal edge-segments, 
while Fy contains all vertical edge-segments. Notice that Fh and Fy may have 
overlap, but they have no crossing. See Figure 0 




Fig. 3. We split a semi- valid drawing F of Ag into Fh and Fy. 



Then split Fy into drawings that have no overlap, and split Fh into 9h 
drawings that have no overlap. As in the first approach, place each obtained 
drawing in z-plane of its own, and extend the nodes through all z-planes that 
contain an incident edge. At every bend of an edge in F, add a z-segment that 
connects the two endpoints of the horizontal and the vertical segment incident 
to this bend. Thus, if an edge had k bends in F, then it has 2k bends in the 
resulting three-dimensional drawing F' . 




Fig. 4. Split Fh and Fy into four drawings each, extend nodes, and add z- 
segments. We show only a selected subset of the added z-segments. 



The advantage of this approach lies in the fact that Fh and Fy have no 
crossings, and hence conflicts can be resolved much easier. However, care has to 
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be taken which edge segment is assigned to which drawing, because the added 
z-segments must not cross. This is possible for a large class of input-drawings T. 

We say that a semi-valid 2D-orthogonal drawing is in general position if 
no grid-line intersects more than one node. In particular, in a 2D-orthogonal 
drawing in general position, every edge has at least one bend, and no more than 
one bend per edge is needed 0 . For any such drawing, we can apply the half-edge 
lifting technique. Details are omitted. 

Theorem 3. Let F he a semi-valid 2D-orthogonal drawing in general position 
with exactly one hend per edge. Assume that F uses a w x h-grid and at most 6 
edges overlap in any given place. Then there exists a 3D-orthogonal drawing in 
a w X h X d-grid, where d < 29. The drawing is in the degree-restricted model. 

This theorem has far-reaching implications. For example, we can use it to 
generalize the interactive drawing results in |2|, the results on orthogonal draw- 
ings with small area 0 , and the results on incremental drawings with small area 
P], because these drawings are in general position with one bend per edge. 

In particular, using the semi- valid drawing used to create the 2D-drawing in 
an X 2i^-grid in 0, we obtain the following results. 

Corollary 1. Every normalized graph has a 3D-orthogonal box-drawing in an 
nxnxA-grid with 2 bends per edge. Node v is contained in a 1 x lx (deg(z;)/2-|-l)- 
grid-hox, so the drawing is in the (2,6)-dr model. 

This drawing improves on the result in [El with respect to the volume {0{n^) 
vs. 0{m^)), and with respect to the surface area of nodes ((2,6)-dr model vs. 
(6,0)-dr model). (The drawings in forbid degenerate node-boxes, but our 
results improve the volume even if we double all grid-dimensions to achieve non- 
degenerate node-boxes.) 

One might criticize in the above construction that the nodes are highly de- 
generate in that they extend only in one direction. With a slightly different 
construction, the nodes become cubes, at the cost of an increase in volume. 

Corollary 2. Every normalized graph has a 3D-orthogonal box-drawing in a 
{n-\-2y/nm) x {n-\-2^/nrn) x -grid with 2 bends per edge. Node v is contained 
in a cube of side-length 2\ ^ deg{v) / 2~\ , so the drawing is in the cube-model. 

The main criticism of drawings created with the approach of lifting half- 
edges is that they are essentially two-dimensional. When looking at the drawing 
from the top, we see the input-drawing F. Moreover, a z-plane that is between 
the z-plane with the largest coordinate used for Fh and the z-plane with the 
smallest coordinate used for T„ intersects all edges and all nodes that have 
incident horizontal and vertical segments. Borrowing a term from computational 
geometry, one could call these drawings 2i-dimensional. Whether such a drawing 
is advantageous or disadvantageous is debatable, but it is a puzzling question 
whether smaller drawings could be achieved by truly making use of the third 
dimension. 
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3.3 The Three-Phase Method 

In this section, we explain a third approach to 3D-orthogonal drawings, which 
imitates the three-phase method for 2D 

In the first phase, node placement, nodes are drawn as points, not as boxes. 
In the second phase, edge routing, we assign bends to every edge. We continue 
to draw nodes as points, hence edges may overlap. In the third phase, port 
assignment, we replace each grid-plane by many grid-planes, and re-assign edges 
to ports of node-boxes such that all overlaps and crossings are removed. 

The crux of the three-phase method is to find node placement and edge 
routing schemes that permit port assignment. For the three-phase method in 
2D, a number of sufficient conditions have been found P|. Unfortunately, they 
do not transfer easily to 3D, because we have to ensure additionally in 3D that 
there are no crossings between edges. We have found a set of sufficient conditions 
that ensure a crossing-free drawing. We state these conditions here, and explain 
the terms, and how to find the port assignment in the next few sections. 

Condition 1 We are given a node placement in line-free position and an edge 
routing using cube-routes. No two nodes coincide. No edge overlaps a node. Two 
edges may cross only it the two crossing segments attach to endpoints of the edges. 
Two edges may overlap only if the overlapping segments attach to a common 
endpoint of the two edges. 

Node Placement A node placement in a 3D-grid is said to be in line-free 
position if every grid-line intersects at most one node; it is said to be in xy- 
general position if every a;-plane and every y-plane intersects at most one node, 
and to be in general position if every grid-plane intersects at most one node. The 
terms are defined similarly for an orthogonal drawing. 

Any node placement in general position is also in ly-general position, and any 
node placement in xy-general position is also in line-free position. We can show 
the feasibility of port assignment (given that the other conditions are satisfied) 
for any node placement in line-free position. However, we managed to find an 
edge routing such that the other conditions are satisfied only in the case of a 
node placement in xy-geneval position or in general position. 



Edge Routing In the three-phase method in 2D, each edge is routed with 
at most one bend; therefore there are at most two possible edge routings. In 
3D, we allow two bends per edge. It is possible to draw each edge with one 
bend in the unlimited growth model 0, but we know of no such results in the 
degree-restricted model. 

Allowing two bends per edge implies six possibilities of routing an edge, 
using the edges of the cube spanned by the two endpoints; we call these routes 
cube-routes. The three segments of an edge are called x-segment, y-segment and 
z-segment. If the two endpoints of an edge have one coordinate in common, then 
the cube degenerates to a rectangle. In this case, we place two bends at the 
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same place (these will be expanded during port assignment). Note that the two 
endpoints cannot have two coordinates in common if the nodes are in line-free 
position. 

We use three subclasses of these cube routes to ensure Condition Q 

— directed z-routes: Directed z-routes are those cube-routes for which the mid- 
dle segment is the z-segment. We associate the two z-routes of an edge 
e = (v,w) with a direction of e; thus e = v ^ w corresponds to the route 
that uses the x-line of v and the y-line of w. 

— color-routes: Eades, Symvonis and Whitesides 0 gave an algorithm for 3D 
point-drawings which uses cube-routes. They restricted their attention to 
three out of the six routes, and associated them with colors. We call these 
routes the color-routes. 

— shortest-middle route: The length of the a;-segment, y-segment and z-segment 
is determined by the position of the endpoints. An edge is said to be routed 
using the shortest-middle route if the middle segment is the shortest segment, 
breaking ties arbitrarily. 




Fig. 5. The directed z-routes, the color-routes, and the shortest-middle routes. 



The following lemmas are proved by straightforward case-analysis considering 
the grid-plane that contains an overlap or a crossing. 

Lemma 1. Let the node placement be in xy-general position, and let the edge 
routing be done with directed z-routes. Then Condition^ is satisfied. 

Lemma 2. Let the node placement be in general position, and let the edge rout- 
ing be done with color-routes. Then Conditionals satisfied. 

Lemma 3. Let the node placement be in general position, and let the edge rout- 
ing be done with shortest-middle-routes. Then Conditionals satisfied. 

To analyze the edge routing, we introduce the following notation: For node v, 
define Aj,{v) = ma.x{ff edges attaching on the — a;-side of v, # edges attaching 
on the -l-x-side f}; thus A^^v) is the number of x-ports needed at v. Similarly 
we define Ay{v) and Az{v). 

One can show that for any node placement there exists an edge routing using 
z-routes such that Ax{v),Ay{v),Az{v) < \deg{v)/2~\ (see Lemma 3 in (3|). We 
leave as an open problem to find an edge routing that satisfies Condition 1 such 
that the bounds are roughly Ax{v) , Ay{v) , Az{v) < \deg{v)/3~\. 
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Port Assignment In this section, we prove that port assignment is feasible if 
Condition Q] is satisfied. 



Lemma 4. Assume that we are given a node placement and edge routing that 
satisfies Condition 0 Then we can assign ports such that there is neither a 
crossing nor overlap. 



Proof. For every node v, choose arbitrary numbers Xy(v), Xz{v),yx{v),yz{v), 
Zx{v),Zy{v) such that Xz{v)yz{v) > Az{v), Xy(v)zx(v) > Ay(v) and yx{v)zy{v) > 
Aa,(z;); and furthermore a;y(u)-|-a;z(u) > l.,yx{v)+yz{v) > land Zx{v)+Zy{v) > 1. 
Good choices will be discussed later. 

For every x-plane Px after node placement, we add max„g {xy (u)} x-planes 
after Px and m.axy^p^{xz{v) — 1} x-planes before Px. For every j/-plane Py af- 
ter node placement, we add max„gp^{?/a;(u)} j/-planes after Py and max^jgp^ 
{yz{v) — 1} y-planes before Py. For every z-plane Pz after node placement, we 
add max„gp^{za;(u)} z-planes after Pz and max„gp^{zy(u) — 1} z-planes before 
Pz. Here, “after” and “before” are taken with respect to the coordinate system. 

Assume that node v was placed in the grid-planes Px, Py, Pz. Then we assign 
to V the grid-box that is the intersection of Xz{v) + Xy{v) x-planes, yz{v) + yx{v) 
y-planes, and Zy{v) + Zx{v) z-planes; using the grid-planes before Px, Py, Pz, then 
Px,Py,Pz, and then the grid-planes after Px,Py,Pz. If two nodes were placed 
at different grid-points during node-placement, then their boxes are disjoint, 
because we added sufficiently many grid-planes around the point of each node. 




Fig. 6. We replace the point of v by the box that spans Xy + Xz x-planes, yx + yz 
y-planes and Zx + Zy z-planes. In this example, Xy = 2, Xz = 2, yx = 2, yz = 3, 
Zx = 3 and Zy = 2; we show the added planes dashed. 



Of the Zx(v) + Zy{v) z-planes for node v, the Zx{v) z-planes above Pz will 
be used to route edges that attach to v with an x-segment, while Pz and the 
Zyiy) — 1 z-planes below it will be used to route edges that attach to v with a 
y-segment. With this technique and by Condition 1, all crossings that occurred 
after edge routing are removed. See Figure 0 for an illustration of which ports 
are actually used at a node. By Xz{v)yz{v) > Az{v), Xy(v)zx(v) > Ay(v) and 
yx{v)zy{v) > Ax{v) there are sufficiently many z-ports, y-ports and x-ports, 
respectively. 
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Now we explain how to assign ports to edges attaching to the — y-side of v 
such that all overlaps are removed and no new crossings are introduced; port 
assignment for the other sides of v is done similarly. We group the edges attaching 
at the — y-side of v into four groups, depending on their direction of continuation. 
We assign sufficiently many — y-ports to each group such that no edges of two 
different groups could possibly cross. Then we sort the edges in each group by 
decreasing y-coordinate of the next bend, and assign them to a port of their 
group such that no two edges within one group cross. Details are omitted, see 
Figure 0 for some explanatory illustrations. 



z 





Fig. 7. On the left, we show the overall appearance of the node v, and indicate 
the used ports in black. On the middle and the right, we sketch the port assign- 
ment procedure: split the ports into four groups, and assign edges to ports in an 
appropriate order. 

Lemma 5. With a suitable choice ofxy{v),Xz{v),yJv),yz{v),Zx{v), and Zy{v), 
the surface area of node v is at most 12deg(t)) -I- 24q 

Proof. Fix a node v and set Ox = max{^ 2 ;('(;), 1}, Oy = m.ax.{Ay{v) ,1} , Uz = 
max{^ 2 (ti), 1}. After possible renaming of coordinates, we may assume Ox < 
o-y < Oz- We will find integers x,y,z with xy > Oz, xz > Oy, and zy > Qx, and 
set Xy{v) = Xz(v) = x> 1, yz(v) = yz(v) = y > 1, and Zx(v) = Zy(v) = z > 1. 

If ttz < QxOy choose X = \^/ayaz/ax],y = \^/axaz/ay~\,z = \^/axay/az']■ 
One can show that then the surface area of v is 4,{xy+yz+zx) < 12(az + ay + az). 

If on the other hand Oz > axOy, define z = l,y = Ox and x = \az/ax'\ > Oy. 
By xy + yz + zx < Ox + (ozlo-x + 1) + <Xx(azlo-x + 1) < 2(aa, -I- Oz) + 1, the surface 
area of v is at most 8(aa, -I- a^). 

By deg{v) > 2, we have Ox + ay + Oz < Ax{v) + Ay{v) + Az{v) + 2 < deg{v) + 2, 
which finishes the proof. 

^ Our focus was on that the drawing is in the degree-restricted model, not on giving 
the smallest constants possible; therefore we chose the parameters for convenience 
of giving a simpler proof. 
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Results The main criticism of Approach II was that the resulting drawings are 
2i-dimensional, in that there exists a z-plane crossing all edges and nodes. Cre- 
ating truly 3D-drawings is straightforward using the three-phase method. Place 
the nodes arbitrarily in general position, route the edges such that Condition 1 
is satisfied (for example using color-routes), and then apply port assignment. 

Using the node placement of |^, but assigning each node arbitrarily to a 
different z-plane, and routing the edges using directed z-routes, one obtains the 
following theorems. 

Theorem 4. Every normalized graph has a 3D-orthogonal box-drawing in gen- 
eral position in an n X n X m-grid with 2 bends per edge. Node v is contained in 
a 1 X 1 X (deg(u)/2 -|- l)-grid-box, so the drawing is in the (2, 6) -dr model. 

Theorem 5. Every normalized graph has a 3D-orthogonal box-drawing in gen- 
eral position in a grid of side-length n -\- 2^/nrn with 2 bends per edge. Node 
V is contained in a cube of side-length 2[y^deg(u)/2] , so the drawing is in the 
cube-model. 

However, these theorems are somewhat unsatisfactory, in that even for the 
smaller drawing we have a significant increase in the volume, from 0{n^) of 
Section ITTI to 0{n^m). So the question arises whether there exists a truly 3D- 
drawing with volume 0{n^). Note that the term truly 3D-drawing is not precisely 
defined. One could define it as a drawing with o(n) nodes per grid-plane, or as a 
drawing with 0{f{n)) nodes per grid-plane, where one could argue the case for 
any of the functions f{n) = y/n, f{n) = and f{n) = 1. 

Our results are in this last and most stringent definition of a truly-3D draw- 
ing, which means drawings in general position. We can show that under these 
conditions, no drawing of volume better than 0{mf) is possible, thus we are 
optimal for graphs with m = 9{n^). 

Theorem 6. Any 3D-drawing in general position has volume I7(max{n^, m^}). 

Proof. Assume that we have a drawing with at most one node per grid-plane. 
Number the nodes as and denote by Xi,yi, Zi and di the dimensions and 

the degree of node i. Because no grid-plane intersects two nodes, the drawing 
must have width >'^Xi, depth >'^yi and height > '^ Zi (all summations are 
from 1 to n). Furthermore, we have 2{xiyi -\- yiZi -\- Zixf) > di and ^di = 2m. 

By Xi,yi,Zi > 1, the lower bound of J7(n^) follows immediately. For the 
lower bound of J7(m^), let df = min{di/6, df = mm{di/6, XiZi}, and 

df = min{di/6, Xij/i}. In at least one of the three directions, the minimum must 
be attained at di/6, therefore + ^i) — (after 

possible renaming of the coordinate directions) 'Yhdl > \m. 

By df < di < n we have dl/^/n < y/df. Therefore ^ > ^ml^/n, and 

by the Cauchy-Schwartz inequality, which yields the result. 

We conjecture that this lower bound can be improved to Q{n^m). 
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4 Conclusion and Open Problems 

In this paper, we studied three-dimensional orthogonal drawings of graphs of 
arbitrarily high degrees. We presented three approaches, and obtained, among 
others, the following results: 

Every normalized graph has a drawing in an n x n x A- grid with 2 bends per 
edge. Thus, the resulting volume is 0{n^), which for the complete graph is a 
square-root factor better than the result of 0{m?) of Papakostas and Tollis ^3|- 
Also, this result matches the 0{n^) construction for the complete graph but 
is in the degree-restricted model for all graphs, not only graphs with degrees in 
6»(n). 

This result falls short of the lower bound of and the construction 

with volume presented in However, the latter construction is not in 

the degree-restricted model, not even for the complete graph. 

Open Problem: In the degree-restricted model, is there a drawing of 
volume or can the lower bound be raised to 0{n^)2 

Every normalized graph has an {n + 2^Jnm) x (n + 2y^nm) x [’s/A] -drawing with 
two bends per edge in the cube-model. This is, to our knowledge, the first result 
in the cube model that also maintains a small surface area of the cube, and a 
small overall volume. However, this result comes at an increase in the volume to 
0{n^m). 

Open Problem: Is there a drawing in the cube-model with volume 0{n^)l 

Every normalized graph G has an nxnx m-drawing in general position with two 
bends per edge. This result does not match the lower bound of I7(max{n^, m^}), 
but we conjecture that no drawing can achieve an asymptotically smaller volume. 

Open Problem: For drawings in general position, is there a drawing of 
volume 0{max{n^ or can the lower bound be raised to 0{n^m)l 

Also, while having all nodes in one place is too two-dimensional, having each 
node in a separate grid-plane seems a waste. What is good middle ground? Do 
there exist small drawings with nodes in every grid-plane? What volume- 

bounds can be achieved? 
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Abstract. We are concerned with the problem of visualizing intercon- 
nections in railroad systems. The real-world systems we have to deal with 
contain connections of thousands of trains. To visualize such a system 
from a given set of time tables a so-called train graph is used. It contains 
a vertex for each station met by any train, and one edge between every 
pair of vertices connected by some train running from one station to the 
other without halting in between. 

In visualizations of train graphs, positions of vertices are predetermined, 
since each station has a given geographical location. If all edges are rep- 
resented by straight-lines, the result is visual clutter with many overlaps 
and small angles between pairs of lines. We here present a non-uniform 
approach using different representations for edges of distinct meaning 
in the exploration of the data. Only edges of certain type are repre- 
sented by straight-lines, whereas so-called transitive edges are rendered 
using Bezier curves. The layout problem then consists of placing control 
points for these curves. We transform it into a graph layout problem and 
exploit the generality of random field layout models for its solution. 



1 Introduction 

The layout problem we are concerned with arises from a cooperation with a 
subsidiary of the Deutsche Bahn AG (the central German train and railroad 
company), TLC/EVA. The aim of this cooperation is to develop data reduction 
and visualization techniques for the explorative analysis of large amounts of time 
table data from European public transport systems. For the most part, these are 
comprised of train schedules. However, the data may also contain bus, ferry and 
footwalk connections. The analysis of the data with respect to completeness, 
consistency, changes between consecutive periods of schedule validity, and so on 
is relevant, e.g., for quality control, (international) coordination, and pricing. 

To condense the data, a train graph is built in the following way: For each 
regular stop of any train, a vertex is added to the network. One arc is added, 
if there is service from one station to another without intermediate stops. For 
convenience, we assume that for each train operating between two stations, there 
is a corresponding train serving the opposite direction. Hence, the train graphs 
considered here are simple and undirected. 
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An important aspect is the classification of edges in two categories: min- 
imal edges and transitive edges. Minimal edges are those corresponding to a 
set of continuous connections between two stations not passing through a third 
one. Typically, these are induced by regional trains stopping at every station. 
On the other hand, transitive edges correspond to connections passing through 
other stations without halting. These are induced by through-trains. Figure 1(a) 



shows part of a train graph with edges colored according to this classification. 
Stations are positioned according to their geographical location, and all edges are 
represented straight-line. An obvious problem are edge overlaps and small angles 
between edges. In order to maintain geographic familiarity, we are not allowed to 
move vertices. Since minimal edges usually represent actual railways, they also 
remain the same, but we refrain from drawing all transitive edges straight-line. 
Instead, we use Bezier curves as shown in Fig. |l(b)[ 




(a) straight lines 



(b) Bezier curves 



Fig. 1. Two different representations of transitive edges in a small train graph 



To render Bezier curves, control points need to be positioned. Using the 
framework of random field layout models introduced in 0, the problem is cast 
into a graph layout problem. More precisely, we consider control points to be 
vertices of a graph, and rules for appropriate positioning are modeled by defining 
edges accordingly. This way, common algorithmic approaches can be employed. 
Practical applicability of our approach is gained from experimental validation. 
In a completely different field of application, the same strategy is used to iden- 
tify suitable layout models for social and policy networks jSj. These real-world 
applications are good examples of how the uniform approach of random field 
layout models may be used to obtain initial models for visualization problems 
which are not clearly defined beforehand. 

The paper is organized as follows. In Sect. 0 we review briefly the concept of 
random field layout models. A specific random field model for train graph layout 
is defined in Sect. 0 Section 0 contains experiments with real-world examples 
and a short discussion on aspects of parametrization. 
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2 Random Field Models 

In this section we review briefly the uniform graph layout formalism introduced 
in . As can be seen from Section 0 model definition within this framework is 
straightforward. 

Virtually every graph layout problem can be viewed as a constrained opti- 
mization problem. A layout of a graph G = (V, E) is computed by assigning 
values to certain layout variables, subject to constraints and an objective func- 
tion. Straight-line embeddings, for example, are completely determined by an 
assignment of coordinates to each vertex. But straight-line representations are 
only a very special case of a layout problem. In its most general form, each el- 
ement of a set L — {li, . . . ,lk} of arbitrary layout elements is assigned a value 
from a set of allowable values Xi, I £ L. Layout elements may represent posi- 
tional variables for vertices, edges, labels, and any other kind of graphical object. 
Therefore, L and X = X^ = x • • • x Xi^, are clearly dependent on the chosen 
type of graphical representation. In this application, we do not constrain con- 
figurations of layout elements. Hence, all vectors x ^ X are considered feasible 
layouts. 

Objective Function. In order to measure the quality of a layout, an objective 
function U : A — > IR is defined. It is based on configurations of subsets of layout 
elements which mutually influence their positioning. This interaction of layout 
elements is modeled by an interaction graph = (L, E^) that is obtained from a 
neighborhood system rj = U/gl Vh where rji C L\{1} is the set of layout elements 
for which the position assigned to I is relevant in terms of layout quality. These 
interactions are symmetric, i.e. I 2 G ^ h G rji^ for all h,l 2 G L, so G^ 
is undirected. The set of cliques in G'' is denoted hy C — C{rj). We define the 
interaction potential of a clique G € C to be any function Uc : A — > IR for which 

xc = yc Uc{x) = Uc{y) 

holds for all x,y G X., where xc = A graph layout objective function 

[/ : A ^ IR is the sum of all interaction potentials, i.e. U{x) = J2ceC ^c{x). By 
convention, the objective function is to be minimized. U{x) is often called the 
energy of x. 

Fundamental Potentials. One advantage of separating the energy function into 
clique potentials is that recurrent design principles can be isolated to form a tool- 
box of fundamental potentials. Not surprisingly, the two most basic potentials 
are those corresponding to the forces used in the spring embedder 0Q 

— Repelling Potential: The rule that two layout elements k and I should not 
lie close to each other can be expressed by a potential 

<,'’(*) = 

^ The original spring embedder does not specify an objective function, but its gradi- 
ents. The above potentials appear in [S|. 
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where is a fixed constant and d{xk,xi) is the Euclidean distance between 
the positions of k and 1. Rep{xk,xi \ g) is used to indicate a specific choice 
of g. 

— Attracting Potential: If, in contrast, k and I should lie close to each other, a 
potential 

^ = a ■ d{xk,xif, 

with a a fixed constant, is appropriate. Like above we use Attr{xk, xi \ a) to 
denote a specific choice of a. 

Since Rep{xk,xi \ X^) + Attr{xk,xi \ 1) is minimized when d{xk,xi) = A, it is easy 
to specify a desired distance between two layout elements (e.g. edge length). Note 
that many other design rules (sufficiently large angles, vertex-edge distance, edge 
crossings, etc.) are easily formulated in terms of clique potentials [^. 

If layouts X G X are assigned probabilities 

P(X = x) = 

where Z = is a normalizing constant, random variable X is a 

(Gibbs) random field. Both X and its distribution are called a (random field) 
layout model for G. Clearly, the above probabilities depend on the energy only, 
with a layout of low energy being more likely than a layout of high energy. By 
using a random variable, the entire layout model is described in a single object. 
Due to the familiar form of its distribution, a wealth of theory becomes applicable 
(a primer in the context of dynamic graph layout is 0). See CH for an overview 
on the theory of random fields, and some of its applications in image processing. 
Since random fields are used so widely, there also is a great deal of literature on 
algorithms for energy minimization (see e.g. [liUjl. 



3 Layout Model 

We now define a random field model for the layout of a train graph G = (V, E). 
Vertex positions are given by geographical locations of corresponding stations, 
and minimal edges as well as very long transitive edges are represented straight- 
line. For the other edges we use Bezier cubic curves (cf. Fig. EjS Let E C E 
be the set of transitive edges of length less than a threshold parameter ti , such 
that the set of layout elements consists of two control points for each edge in E, 
L — {bu{e),by{e) \ e = {u, u} S E}. If two Bezier points belong to the same edge, 
they are called partners. The anchor, ab^(e), of 6„(e) is u, while the anchor of 
by(e) is V. The default position of all Bezier points is on the straight line through 
the endpoints of their edges at equal distance from their anchor and from their 
partner (and hence uniquely defined). 

^ It will be obvious from the examples presented in Section 0 why it is not useful to 
represent all transitive edges by Bezier curves. 
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Fig. 2. Bezier cubic curve 0. Two endpoints and two control points define a 
smooth curve that is entirely enclosed by the convex hull of these four points 



The position assigned to a Bezier point is influenced by its partner, its anchor, 
all Bezier points with the same anchor, and a number of close stations and Bezier 
points. Let {u,v} G if be a transitive edge, and let 6 G L be a Bezier point of 
{u,v}. Given two parameters ei and C 2 , consider an ellipse with major axis 
going through u and v. Let its radii be ei • ^nd €2 ■ respectively. 

We denote the set of all stations and Bezier points (at their default position) 
within this ellipse, except for b itself, by £b- Recall that the neighborhood of some 
layout element consists of all those layout elements that have an influence on 
its positioning. Therefore, rjb equals the union of £b H L, the set of Bezier points 
with the same anchor as b, and (since interactions need to be symmetric) the set 
of Bezier points b' for which 5 G £b'- For the examples presented in Section 0 we 
used ei = 1.1 and 62 = 0.5. 

An interaction potential is defined for each design goal that a good layout of 
Bezier points should achieve: 

— Distance to stations. For each Bezier point b G L of some edge {u, v} G E, 
there are repelling potentials 

Rep{xb,s\{gi ■ Xb)^) , 

seSbOV 



with \b = and qi a constant. These ensure reasonable distance from 

stations in the vicinity of b and can be controlled via gi. A combined repelling 
and attracting potential 

Rep (xb, Ub I (Ai • A{,)'‘) + Attr{xb,ab), 

where A is another constant, keeps b sufficiently close to its anchor as- 
— Distance to near Bezier points. As is the case with near stations, a Bezier 
point bi G L should not lie too close to another Bezier point 62 G rjb^ . If 61 
is neither the partner of nor bound to 62 (binding is defined below), we add 

Rep (xhi , Xb^ I £<2 • min{Abj , A^^ }) . 
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The desired distance between partners b\ and 62 is equal to the desired 
distance from their respective anchors, 

Rep {xb^,Xb2 I (Ai • AhJ^) + Attr{xbi,Xb2)- 

— Binding. In general, it is not desirable to have Bezier points 5 i, 62 € L with 
a common anchor lie on different sides of a minimal edge path through the 
anchor. Therefore, we bind them together, if Xb^ does not differ much from 
Ahj, i.e. if ^ < ^2 for a threshold T2 > 1, we add potentials 

13 ■ {Rep{xbi , Xb2 I A 2 • (Afcj + A 5 J/ 2 ) + Attr{xb^ , Xb^)) , 

where A2 is a stretch factor for the length of binding edges, and /3 controls 
the importance of binding relative to the other potentials. 

In summary, the objective function is made of nothing but attracting and re- 
pelling potentials that define a graph layout problem in the following way: 
Stations correspond to vertices with fixed positions, while Bezier points cor- 
respond to vertices to be positioned. Edges of different length exist between 
Bezier points and their anchors, between partners, and between Bezier points 
bound together. Just like edge lengths, repulsion differs across the elements. See 
Fig. Eland recall that repelling forces act only locally (inside of neighborhoods). 
Let 9 = (gii, g2, Ai, A2, / 3 , Ti, T2) denote the vector of parameters. The effects of 
its components are summarized and demonstrated in Section 0 




Fig. 3. Graph model of Bezier point layout dependencies for the train graph 
of Fig. |l(b)[ Note that there is no binding between the two layout elements 
indicated by black rectangles, because their distances from the anchor differ too 
much (threshold parameter T2) 
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4 Experiments 

In order to obtain an objective function, we experimented with different poten- 
tials and parameters. We started with a simple combination of repelling forces 
from stations and attracting and repelling forces from partners and anchors. 
In fact, we first used splines to represent transitive edges. It seemed that they 
offered better control, since they actually pass through their control points. How- 
ever, segments between partners tended to extend far into the layout area. After 
replacing splines by Bezier curves, the promising results encouraged us to try 
more elaborate objective functions. They showed that it is useful to represent 
long transitive edges straight-line, which led to the introduction of threshold ri . 
A new requirement we found after looking at earlier examples was that incident 
(consecutive or nested) transitive edges should lie on one side of a path of mini- 
mal edges. Binding proved to achieve this goal, but needed to be constrained to 
control segments of similar desired length using a threshold T 2 . Otherwise, short 
transitive edges are deformed when bound to a long one. 

For convenience, we use the final combination of potentials and different 
choices of 6* = (pi, g 2 , Ai, A2,/3, n, T 2 ) to demonstrate the effect of single pa- 
rameters in Fig. 21 In particular. Fig. 14(d)| shows why binding is a valuable 
refinement. The following table summarizes these effects: 



Qi 

Q2 

Ai 

A2 

/3 

n 

T2 

ei 

C2 



controls 

distance of Bezier points from stations 

mutual distance of Bezier points 

length of control segments 

length of bands 

importance of binding 

threshold for straight transitive edges 

threshold for binding segments of different length 

major axis radius of neighborhood defining ellipse 

minor axis radius of neighborhood defining ellipse 



Next to a choice that proved appropriate (Fig. |4(a)|l, it is clearly seen how 
increased repelling forces spread Bezier points (Figs. |4(b]| and |4(c)|). W ithout 
binding, curves tend to lie on different sides of minimal edges (Fig. 14(d‘]t . This 
can even be enforced (Fig. |4(e)'l ). 

The identification of a suitable set of parameters is a serious problem. Men- 
donga and Fades use two nested simulated annealing computations to identify 
parameters of a spring embedder variant |0|. In 0, a genetic algorithm is used 
to breed a suitable objective function. However, both methods are heuristic in 
defining their objective as well as in optimizing it. Given one or more examples 
which are considered to be well done (e.g. by manual rearrangement), a theoret- 
ically sound approach would be to carry out parameter estimation for random 
variable X{9) describing the layout model as a function of parameter vector 9. 
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(a) Regular 

e = (0.3, 0.7, 0.7, 0.5, 0.4, 100, 2.2) 




(b) Station repulsion 
e = (5, 0.7, 0.7, 0.5, 0.4, 100,3) 




(d) No binding 
0 = (0.3, 0.7, 0.7, 0,0, 100,0) 




(c) Segment stretching 
e= (0.3,4, 1,0.5, 0.4, 100,3) 




(e) Inverse binding 
e = (0.3,0.7,0.7,2,1,100,3) 



Fig. 4. Effects of single parameters. For a better comparison, control segments 
are shown instead of the corresponding Bezier curves. All examples have ei = 1.1 
and C 2 = 0.5 
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Given a layout x, the likelihood of 0 is 

P(X = xj0) = -^exp{-U(xl0)} 



where Z(0) = 1 ^)} normalizing constant. A maximum 

likelihood estimate 0* is obtained by maximizing the above expression with re- 
spect to 0. Unfortunately, computation of Z(0) is practically intractable, since 
it sums over all possible layouts. One might hope to reduce computational de- 
mand by exploiting the locality of random fields (see e.g. mi). Even though 
neighboring layout elements are clearly not independent, reasonable estimates 
are obtained from the pseudo-likelihood function 



n 

IGL 




with Zi(0) = exp{— X)cGC-JeC \ 6*)}, where x equals x with xi re- 

placed by yi - However, Z;(0) is a sum over all possible positions of layout element 
I, such that maximization is still intractable in this setting. So we exploited lo- 
cality in a very different way, namely by experimenting with small examples as in 
Fig. El The parameters 0 thus identified proved appropriate, because the model 
scales so well. 

To carry out the above experiments, and to generate large examples, we used 
an implementation of a fairly general random field layout module. It contains a 
set of fundamental neighborhood types and interaction potentials, to which oth- 
ers can be added. Since current concern is on flexibility and model design, a sim- 
ple simulated annealing approach is used for energy minimization. Clearly, faster 
and more stable methods can be employed just as well. The original datasets 
provided by TLC/EVA are quite large. A train graph of roughly 2,000 vertices 
and 4,000 edges, for instance, is built from about 11 MByte of time table data. 
Connections are then classified into minimal and transitive edges. Existing code 
was used for these purposes. 

The first example is shown in Fig. The graph contains regional trains in 
south-west Germany. Edge classification, transformation into a layout graph, 
neighborhood generation, and layout computation took less than two minutes. 
Figs. |5(b'y| and show magnified parts of the drawing from Fig. |5(a)| Verify 
that connections can be told apart quite well, and that binding causes incident 
(consecutive or nested) transitive edges to lie on the same side of minimal edges. 

Larger examples are given in Figs. 0 and0 Computation times were about 
35 minutes and 90 minutes, respectively, most of which was spent on generating 
the graph layout model and determining neighborhoods. One readily observes 
that the algorithm scales very well, i.e. increased size of the graph does not reduce 
layout quality on more detailed levels. This is largely due to the fact that neigh- 
borhoods remain fairly local. Together with the ability to zoom into different 
regions, data exploration is well supported. The benefits of a length threshold 
for curved transitive edges is another straightforward observation, notably in 
Fig. |7(a)| 
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0 vertices, 
L is into t. 
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(c) Strasbourg 



(b) Paris (note the long-distance sta- 
tions!) 



Fig. 7. French connections: ca. 4,500 vertices, 7,800 edges (2,500 transitive), 
e = (0.7, 0.3,0.7,0.5,0.4,100,3) 
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Abstract. Preserving the “mental map” is major goal of interactive 
graph drawing algorithms. Several models have been proposed for for- 
malizing the notion of mental map. Additional work needs to be done to 
formulate and validate “difference” metrics which can be used in practice. 
This paper introduces a framework for defining and validating metrics 
to measure the difference between two drawings of the same graph. 



1 Introduction 

Graph drawing algorithms have traditionally been developed using a batch model, 
where the graph is redrawn from scratch every time a drawing is desired. These 
algorithms, however, are not well suited for interactive applications, where the 
user repeatedly makes modifications to the graph and requests a new drawing. 
When the graph is redrawn it is important to preserve the look (the user’s “men- 
tal map”) of the original drawing as much as possible, so the user does not need 
to spend a lot of time relearning the graph. 

The problems of incremental graph drawing, where vertices are added one 
at a time, and the more general case of interactive graph drawing, where any 
combination of vertex/edge deletion and insertion is allowed at each step, have 
been starting to receive more attention. See, for example, 0, 011011 HI cni 
ini Da EH, E2|. However, while the algorithms themselves have been motivated 
by the need to preserve the user’s mental map, much of the evaluation of the 
algorithms has so far focused on traditional optimization criteria such as the area 
and the number of bends and crossings (for example, [2j tTjll If 9j). Mental map 
preservation is often achieved by attempting to minimize the change between 
drawings — typically by allowing only very limited modifications (if any) to 
the position of vertices and edge bends in the existing drawing — making it 
important to be able to measure precisely how much the look of the drawing 
changes. Animation can be used to provide a smooth transition between the 
drawings and can help compensate for greater changes in the drawing, though it 
is still important to limit, if not minimize, the difference between the drawings 

* Research supported in part by the U.S. Army Research Office under grant DAAH04- 
96-1-0013, by the National Science Foundation under grants CCR-9732327 and CDA- 
9703080, and by a National Science Foundation Graduate Fellowship. 
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(a) (b) (c) 

Fig. 1. The rotation problem, (a) is the user-modified graph (the user’s changes 
are shown in gray); (b) and (c) are the unrotated and rotated InteractiveGiotto 
outputs, respectively. While (b) and (c) are both clearly drawings of the graph 
shown in (a), the resemblance is more readily seen in the properly rotated draw- 
ing (c). 

because if there is a very large change it can become difficult to generate a clear, 
useful animation. It is thus still important to have a measure of how the look of 
the drawing changes. 

Studying “difference” metrics to measure how much a drawing algorithm 
changes the user’s mental map has a number of benefits, including 

— providing a basis for studying the behavior of constraint-based interactive 
drawing algorithms like InteractiveGiotto U, where meaningful bounds on 
the movement of any given part of the drawing are difficult to obtain, 

— providing a technique to compare the results of different interactive drawing 
algorithms, and 

— providing a goal for the design of new drawing algorithms by identifying 
which qualities of the drawing are the most important to preserve. 

Finding a good difference metric also has an immediate practical benefit, namely 
solving the “rotation problem” of InteractiveGiotto. Giotto the core of Inter- 
activeGiotto, does not take into account the coordinates of vertices and bends in 
the original drawing when constructing a new drawing — and, as a result, the 
output of InteractiveGiotto may be rotated by a multiple of 90 degrees (Figure Q). 
The problem can be solved by computing the value of the metric for each of the 
possible rotations and choosing the rotation with the smallest value. 

Several models have been proposed for formalizing the notion of mental 
map |Bl OI ^|, though more work needs to be done to formally define po- 
tential difference metrics and then experimentally validate (or invalidate) them. 
Validation can be via user studies similar to those done in pn ITTI to evaluate 
the impact of various graph drawing aesthetics on human understanding. 

Motivated by applications to InteractiveGiotto, this paper will focus primarily 
on difference metrics for orthogonal drawings, though many of the metrics can be 
used without modification for arbitrary drawings. Section|^ describes the metrics 
being proposed. Section 0 presents a framework for evaluating the suitability of 
the metrics along with a preliminary evaluation, and Section 2] outlines plans for 
future work. 
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2 Metrics 

The difference metrics being proposed fall into five categories: distance, proxim- 
ity, orthogonal ordering, shape, and topology. While the distance category could 
be considered a subset of proximity, it is kept separate to distinguish between 
metrics using the Euclidean distance between points and those using relative or- 
derings based on the distances. Proximity, ordering, and topology are suggested 
in |Hl El Ej as qualities which should be preserved; distance, suggested in H3|, 
and shape reflect intuition about what causes drawings to look different. Within 
each category the metrics were chosen to capture intuition about what qualities 
of the drawing are important to preserve. 

It should be noted that all of the metrics are concerned only with the geo- 
metric aspects of the drawing; other features such as node color and line styles 
are also very important in preserving the look of the drawing and may be able 
to at least partially compensate for geometric changes. 

2.1 Preliminaries 

Terminology Every metric compares two drawings D and D' of the same 
graph G. Each object of the graph G can be associated with two sets of coor- 
dinates, one describing the position in D and the other the position in D' . A 
matched set of objects is a set of the pairs describing the object’s position in the 
two drawings. The matched sets used are sets of points and sets of edges. A point 
is any aspect of the graph that has a single coordinate, such as the centers and 
corners of vertices. The matched point set Pm is the set of pairs (pi,p'i) where pi 
is the location of point i in D and p'^ is the location of point i in D' . Let d(j>, q) 
be the (Euclidean) distance between two points p and q. An edge is simply an 
edge of a graph; the important features are the coordinates of the endpoints and 
bends. The matched edge set Em is the set of matched edges (ei,e') where is 
the edge i in D and e' is the edge i in D' . 

Drawing Alignment Most of the metrics compare coordinates between draw- 
ings, which means that they are sensitive to the particular values of the coor- 
dinates. Figure E| illustrates this sensitivity — if the difference metric relies on 
the distance d{pi,pi) between points, adjusting the scale and translation of one 
point set relative to the other makes a large difference. 

To eliminate this effect, the drawings are aligned before the metric is com- 
puted. This is done by extracting a (matched) set of points from the drawings 
and applying a point set matching algorithm to obtain the best fit. In general the 
matching algorithm should take into account scaling, translation, and rotation, 
though it may be possible to eliminate one or more of the transformations for 
certain metrics or if something is known about the relationship between the two 
drawings. For example, interactive drawing algorithms often preserve the rota- 
tion of the drawing (see j3[^ for examples), eliminating the need to consider 
rotation in the alignment stage. For orthogonal drawings, there are only eight 
possible rotations for the second drawing relative to the first — four multiples 
of 90 degrees, applied to the original drawing and its reflection about the x axis. 
These eight possibilities can be handled by computing the metric separately for 
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Fig. 2. Two point sets (black and gray) superimposed. (Corresponding points in 
the two sets are connected with dotted lines.) As shown, the Euclidean distance 
metric (Section 12.211 would report a distance of 4.25. However, translating the 
gray points one unit to the left and then scaling by 1/2 in the x direction allows 
the point sets to be matched exactly, for a distance of 0. It should be noted that 
exact matches are not possible in general. 

each rotation and taking the minimum value instead of incorporating rotation 
into the alignment process. 

A great deal of work has been done on point set matchings; see PEm for 
several methods of obtaining both optimal and approximate matchings. Differ- 
ent methods can be applied when the correspondence between points is known 
as it is here; see m for an algorithm that minimizes the maximum distance 
between corresponding points under translation, rotation, and scaling. In the 
implementation used in Section 0 the alignment is performed by using gradient 
search to minimize the distance squared between points. 

Point Set Selection Points can be selected in a number of ways. Two vertex- 
centered methods — centers and corners — are used here, to reflect the idea that 
vertex positions are a significant visual feature of the drawing HZl- “Centers” 
consists of the center points of each vertex; this captures how vertices move. 
“Corners” uses the four corners of each vertex, taking into account both vertex 
motion (the movement of the center) and changes in the vertex dimension. It 
seems important to take into account changes in vertex dimension because a 
vertex with a large or distinctive shape can act as a landmark to orient the user 
to the drawing; loss of that landmark makes orientation more difflcult. 

2.2 Distance 

The distance metrics reflect the simple observation that drawings that look very 
different cannot be aligned very well, and vice versa. In order to make the value 
of the distance metrics comparable between pairs of drawings, they are scaled 
by the graph’s unit length u. For orthogonal drawings the unit length can be 
computed by taking the greatest common divisor of the Manhattan distances 
between vertex centers and bend points on edges. Non-orthogonal portions of 
the drawing, such modifications of an orthogonal drawing made by the user, can 
be ignored during the computation. While the determination of the unit length 
will be unreliable if a small portion of the drawing is orthogonal, scaling by 
the unit length is not necessary in some applications (e.g., solving the rotation 
problem of InteractiveGiotto) and can often be supplied manually if it is required. 
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Fig. 3. Proximity: (b) looks more like (a) than (c) does because the relative 
shape of both the inner and outer squares are preserved even though the distance 
(using the Euclidean distance metric) between (c) and (a) is smaller. An aligned 
version of the vertices of (a), used in the computation of the distance metric, is 
shown with dotted lines in (b) and (c). 



Hausdorff Distance The Hausdorff distance is a standard metric for deter- 
mining the distance between two point sets, and measures the largest distance 
between a point in one set and its nearest neighbor in the other. 

The (undirected) Hausdorff distance for a matched point set Pm is 

hausdorff( Pm ) = — max{ max min d {pi , p) ) , max min d (p( ,Pj)} 
u i 3 i 3 



Euclidean Distance Euclidean distance, introduced in m, is a simple metric 
measuring the average distance moved by each point from the first drawing to 
the second; it is motivated by the notion that if points move a long way from 
their locations in the first drawing, the second drawing will look very different. 



dist(D, D') 



(PiPO&Pm 



2.3 Proximity 

The proximity metrics reflect the idea that points near each other in the first 
drawing should remain near each other in the second drawing. This is stronger 
than the distance metrics because it captures the idea that if a subgraph moves 
relative to another (but there are no changes within either subgraph), the dis- 
tance should be less than if each point in one of the subgraphs moves in a different 
direction (Figure 0- 

Three different metrics are used to try to capture this idea: nearest neighbor 
within (nn- within), nearest neighbor between (nn-between), and e-clustering. 

Nearest Neighbor Within Nearest neighbor within is based on the reasoning 
is that if pj is the closest point to pi in D, then p' should be closest point to p( 
in D' . Considering only distances within a single drawing means that nn- within 
is alignment-independent and thus not subject alignment errors, but means that 
it is not suitable for solving the rotation problem of InteractiveGiotto. 



62 



Stina Bridgeman and Roberto Tamassia 



This metric has two versions, weighted and unweighted. In the weighted ver- 
sion the number of points closer to p[ than pt is considered, whereas in the 
unweighted version it only matters if p' is or is not the closest point. The rea- 
soning behind the weighted version is that if there are more points between p[ 
and p'j, the visual linkage between p' and p' has been disrupted to a greater 
degree and the drawing looks more different. 

In both cases the distance is scaled by the number of points being considered 
and W, the maximum weight contributed by a single point. 

nnw(£), £>') = ^ , p , closer(p',p' ) 

(Pi.P-)GPm 

where pj is the closest point to pi in D and 

closer(p',p') = \{k \ d(p',Pfe) < d(p',p')}| , W = \Pm\ - 2 (weighted) 



closer(p',p') 



0 if d(p',p' ) < d(p',p'fc), /c yf i 

1 otherwise 



(unweighted) 



Nearest Neighbor Between Nearest neighbor between is similar to nn-within 
but instead measures whether or not p' is the closest of the points in D' to p 
when the two drawings are aligned. The idea that a point should remain nearer 
to its original position than any other is also the force behind layout adjustment 
algorithms based on the Voronoi diagram H3- 



nnb(Z?, D') 



1 

W\Pm\ 



closer(p,,p') 

(Pi.P')GPm 



where 

closer(p*,p') = \{j I d(p*,p') < d(p^,p')}| , W = \Pm\ - 1 (weighted) 



closer(pi,p() = I J ^ d{p^,pj) ^ ^ ^ ^ (unweighted) 

Unlike nn-within, nn-between is not alignment- and rotation-independent 
and thus is suitable for solving the rotation problem. 

e- Clustering An e-cluster for a point p is the set of points q such that d(p, q) < e, 
where a reasonable value to use for e is (see |H|) 

e = max min d(p, q) 

p q^p 

The e-cluster metric measures how the e-cluster for pi compares to that for p'. 
Let C be the graph where the vertex set is the set of points and {i,j) G Ec 
if d{pi,pj) < CD or d(p',p') < cd'- Let Ecd be the set of edges for which the 
first condition holds and be the set of edges for which the second condition 
holds. The distance is thus 

Ecd ri Ec^, I 
\Ec\ 



cluster(D, D') = 1 
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(a) (b) (c) 



Fig. 4. Orthogonal ordering: Even though the angle the vertex moves relative to 
the center of the large vertex is the same from (a) to (b) and from (a) to (c), the 
perceptual difference between (a) and (c) is much greater. The original location 
of the vertex is shown with a dotted box in (b) and (c) for comparison purposes. 



2.4 Orthogonal Ordering 

The orthogonal ordering metric reflects the desire to preserve the relative order- 
ing of every pair of points — if p is northeast of q in D, p' should remain to 
the northeast of q' in D' PHI]. The simplest measurement of difference in the 
orthogonal ordering is to take the angle between the vectors q — p and q' — p' {un- 
weighted orthogonal ordering) . This has the desirable feature that if q is far from 
p, q' — q must be larger to result in the same angular move, which reflects the 
intuition that the relative position of points near each other is more important 
that the relative position of points that are far apart. 

However, simply using the angular change fails to take into account situations 
such as that shown in Figure 0 This problem can be addressed by introducing 
a weight that depends on the particular angles involved in the move in addition 
to size of the move {weighted orthogonal ordering). 

OTdei{D,D') = ^ |p I X] min(order(%, 6<b), order(6<b, %)) 



where 9ij is the angle from the positive x axis to the vector pj — pi , 0b is the 
angle from the positive x axis to the vector p) — p', and 



order(0y,0b) = 



weight (0) d0, 



W = min 



The weight functions are 



2ir 



weight(0) d0, / weight(0) d9 



r-(® mod f) ; 



weight (0) = 



9 mod 



if (0 mod f ) > f 
if (0 mod f ) < f 



(weighted) 



weight (0) = 1 



(unweighted) 



The A-matrix model for measuring the difference of two point sets, introduced 
in HSj, is based on the concept of order type of a point set nni. This model tries 
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(a) (b) 

Fig. 5. Shape: (a) and (b) look different even though the graphs are the same 
and the vertices have the same coordinates. 

to capture the notion of the relative position of vertices in a straight-line drawing 
and is thus related to the orthogonal ordering metric. 



2.5 Shape 

The shape metric is motivated by the reasoning that edge routing may have an 
effect on the overall look of the graph (Figure EJ. The shape of an edge is the 
sequence of directions (north, south, east, and west) traveled when traversing 
the edge; writing the shape as a string of N, S, E, and W characters yields the 
shape string of the edge. For non-orthogonal edges the direction is taken to be 
the most prominent direction; for example, if the edge goes from (1,1) to (4,2) 
the most prominent direction is east. For each edge the minimum number of 
edits to transform the shape string in one drawing to the string in the other is 
computed, where an edit consists of inserting, deleting, or replacing a character 
in the shape string. The metric is the average number of edits per edge. 



shape(Zl, D') 



1 

J^\ 



edits(ci,e') 



Shape is scale- and translation-independent. 



2.6 Topology 

The topology metric reflects the idea that preserving the order of edges around 
a vertex is important in preserving the mental map |E1 — comparing the 

drawing produced by Giotto in Figures El and Q to the user’s input illustrates 
this. However, since most interactive orthogonal drawing algorithms (see, for 
example, gl 13 El ESI ESI) always preserve topology, it is not useful as a means 
of comparing these algorithms. It is also alignment-independent and so can not 
be used to solve the rotation problem of InteractiveGiotto. As a result, it is not 
discussed in any more detail here. 
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3 Analyzing the Metrics 

Once defined, the suitability of the metrics must be evaluated. A good metric 
for measuring the difference between drawings should satisfy the following three 
requirements: 

— it should qualitatively reflect the visual difference between two drawings, i.e. 
the value increases as the drawings diverge; 

— it should quantitatively reflect the visual difference so that the magnitude of 
the difference in the metric is proportional to the perceived difference; and 

— in the rotation problem of InteractiveGiotto, the metric should have the small- 
est value for the correct rotation, though this requirement can be relaxed 
when the difference between drawings is high since in that case there is no 
clear “correct” rotation. 

The third point is the easiest to satisfy — in fact, most of the metrics defined 
in the previous section can be used to solve the rotation problem — but is still 
important worth considering since the problem was one of the factors that first 
inspired this work. 

Evaluating the qualitative and quantitative behavior of potential metrics 
requires ranking pairs of drawings based on the visual difference between the 
existing drawing and the new drawing in each pair. InteractiveGiotto provides 
a convenient way of obtaining an ordered set of drawings because it allows the 
constraints preserving the layout to be relaxed. By default InteractiveGiotto pre- 
serves edge crossings, the direction (left or right) and number of bends on an 
edge, and the angles between consecutive edges leaving a vertex. Recent modi- 
fications allow the user to turn off the last two constraints on an edge-by-edge 
or vertex-by-vertex basis. This means that it is possible to produce a series of 
drawings which are progressively farther from the original by iteratively relaxing 
more of the constraints. A smooth way of relaxing the constraints is to use a 
breadth-first ordering, expanding outward from the user’s modifications — in 
the first step all of the constraints are applied, in the second step the bend and 
angle constraints are relaxed for all of the modified objects, in the third step 
the angle constraints are relaxed for all vertices adjacent to edges whose bends 
constraints have been relaxed, in the fourth step the bend constraints are relaxed 
for all edges adjacent to vertices whose angle constraints have been relaxed, and 
so on, alternating between angle and bend constraints until all of the constraints 
have been relaxed. This relaxation method is based on the idea that the user 
is most willing to allow restructuring of the graph near where her changes were 
made, so progressively expanding the sphere of influence of the changes results 
in a series of drawings in which the mental map is increasingly disrupted. 

Figures El and El show two such relaxation sequences; the base graphs and 
user modifications are those used in the first two steps of Figure 2 in Giotto’s 
redraw-from-scratch drawing of the graph is also included for comparison. Fig- 
ures 0 and El show the results of the difference metrics for each sequence of 
drawings. 
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Fig. 6. Relaxations for stage 1. (a) shows the user’s modifications, (b)-(e) show 
the output from InteractiveGiotto for successive relaxations, and (f) is the output 
from Giotto. Rounded vertices and bends without markers indicate vertices and 
bends for which the constraints have been relaxed. 

Some preliminary work has been done on evaluating the proposed metrics 
based on their qualitative behavior. Six sets of graphs consisting of a base graph 
modified by several successive sets of modifications were generated from two 
applications where interactive graph drawing algorithms are useful — mapping 
how a user might explore topics when querying a search engine and mapping 
the exploration of a web site by a user or web crawler. Two other sets were 
generated from two of the same sets by breaking the modifications generated by 
each of the user’s steps into single changes which were applied individually. A 
total of 62 graphs were generated in this way; for each a series of drawings was 
produced using InteractiveGiotto with progressively relaxed constraints. Giotto 
was also run on each graph. 
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(j) (k) 



Fig. 7. Relaxations for stage 2. (a) shows the starting graph, (b) shows the user’s 
modifications (two vertices and their adjacent edges deleted), (c)-(j) show the 
output from InteractiveGiotto for successive relaxations, and (k) is the output 
from Giotto. Rounded vertices and bends without markers indicate vertices and 
bends for which the constraints have been relaxed. 



The initial analysis shows that when the user’s modifications are small (af- 
fecting only a single vertex or edge) most of the metrics are well-behaved — 
that is, they increase in value as the constraints are relaxed, with the highest 
value being for the Giotto drawing. When the size of the changes is increased, 
shape. Euclidean distance, and to some degree clustering tend to remain the 
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(a) 



Values of metrics for each drawing in stage 1. 
Drawings (b)-(f) are shown from front to back. 




°ogc5?gl<-i : T^: 



(b) Values of metrics for each drawing in stage 2. 

Fig. 8. Striped bars indicate entries for which the metric reported the wrong 
rotation; white outlines indicate that multiple rotations had the same small- 
est value for the metric. The label for each metric is of the form Kmetric 
name> /<weighted> /-/<alignment> /<points> , where <weighted> is t or f in- 
dicating if the weighted version of the metric was used (if applicable), and 
<alignment> and <points> are strings indicating the choice of points used for 
alignment and points, respectively, c- indicates using vertex centers, -c- indi- 
cates vertex corners, and the last position (t or f) indicates whether the edges 
and vertices modified by the user were considered. 
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Fig. 9. HausdorfT and Euclidean distance metrics for stages 1 and 2, respectively. 

most well-behaved. Further study is needed to determine why these effects are 
being observed and if they are indeed typical of the metrics. 

4 Future Work 

As mentioned, the natural next step is to perform more extensive experimen- 
tation using the relaxation sequences produced by InteractiveGiotto from both 
real-world and randomly generated “typical” graphs. The randomly generated 
graphs can be created using a method similar to that used in |Z] to produce 
realistic graphs for the experimental analysis of several orthogonal drawing al- 
gorithms. 

The sequences produced by InteractiveGiotto allow a qualitative assessment 
of the metrics — it is easy to see visually that the difference between the base 
drawing and the relaxed drawing increases as the constraints are relaxed — but 
do not allow for a very good quantitative assessment because there is only a 
relative ordering of the drawings within a sequence and there is no ordering of 
drawings from different sequences relative to each other. User studies can be used 
to identify the importance of factors such as distance between points, proximity, 
orthogonal ordering, and shape in the user’s mental map and assign difference 
values to each drawing, which can then be compared to the values of the metrics. 

Once suitable metrics have been identified and validated through user stud- 
ies, they can be used to compare the behavior of interactive graph drawing algo- 
rithms as well as potentially providing inspiration for new drawing algorithms. 
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Abstract. In this paper we look at upward planarity from a new per- 
spective. Namely, we study the problem of checking whether a given 
drawing is upward planar. Our checker exploits the relationships between 
topology and geometry of upward planar drawings to verify the upward 
planarity of a significant family of drawings. The checker is simple and 
optimal both in terms of efficiency and in terms of degree. 

1 Introduction 

The intrinsic structural complexity of the implementation of geometric algo- 
rithms makes the problem of formally proving the correctness of the code unfea- 
sible in most of the cases. This has been motivating the research on checkers. A 
checker is an algorithm that receives as input a geometric structure and a predi- 
cate stating a property that should hold for the structure. The task of the checker 
is to verify whether the structure satisfies or not the given property. Here, the 
expectation is that it is often easier to evaluate the quality of the output than 
the correctness of the software that produces it. Several authors f2JE,IS| agree 
on the basic features that a “good” checker should have: 

Correctness: The checker should be correct beyond any reasonable doubt. Oth- 
erwise, one would fall into the problem of checking the checker. 

Simplicity: The implementation should be straightforward. 

Efficiency: The expectation is to have a checker that is not less efficient than 
the algorithm that produces the geometric structure. 

Robustness: The checker should be able to handle degenerate configurations 
of the input and should not be affected by errors in the flow of control due 
to round-off approximations. 

* Research supported in part by the ESPRIT LTR Project no. 20244 - ALCOM-IT 
and by the CNR Project “Geometria Computazionale Robusta con Applicazioni alia 
Grafica ed al CAD.” 
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Checking is especially relevant in the graph drawing context. In fact, graph 
drawing algorithms are among the most sophisticated of the entire computational 
geometry field, and their goal is to construct complex geometric structures with 
specific properties. Also, because of their immediate impact on application areas, 
graph drawing algorithms are usually implemented right after they have been 
devised. Further, such implementations are often available on the Web without 
any certification of their correctness. Of course, the checking problem becomes 
crucial when the drawing algorithm deals with very large data sets, when a 
simple complete visual inspection of the drawing is difficult or unfeasible. 

Devising graph drawing checkers involves answering only apparently innocent 
questions like: “is this drawing planar?” or “is this drawing upward?” or “are the 
faces convex polygons?” . The problem of checking the planarity of a subdivision 
has been pioneered in uaini. In those papers linear time algorithms are given to 
check the planarity of a subdivision composed by convex faces. The inputs are 
the subdivision plus its topological embedding in terms of the ordered adjacency 
lists of the edges. Unfortunately, extending the above techniques to checking 
the planarity of a subdivision whose faces are not constrained to be convex, 
relies on the usage of algorithms for testing the simplicity of a polygon. The 
only general linear time algorithm known for this problem is the fairly complex 
algorithm in 0. Hence, devising a checker based on such algorithm would not 
satisfy the simplicity requirement. The algorithm in P| tests the simplicity of a 
polygon by means of an intermediate triangulation step. Alternative algorithms 
that can triangulate in linear time special classes of polygons have been devised. 
See e.g. 0. Other almost optimal algorithms can be found in [nnin] 

In this paper we study the problem of checking the upward planarity of a 
drawing. Upward planarity is a classical topic in graph drawing and several 
papers deal with the problem of testing whether a given graph has an upward 
planar drawing and eventually constructing it. For an overview, see 0. We look 
at the problem from a different perspective. The main results of this paper are: 

(i) We introduce and study regular wpwa,rd planar embeddings. We show that 
such embeddings coincide with those that have a “unique” including planar st- 
digraph. There are several families of digraphs whose upward planar embeddings 
are always regular. E.g. rooted trees, planar st-digraphs, and planar sT-digraphs 
(i.e. single-source digraphs). The great majority of algorithms for constructing 
upward planar drawings receive as input such digraphs, (ii) We exploit the con- 
cept of regularity to investigate the relationships between topology and geometry 
of upward planar drawings. In particular, we show that an upward drawing of a 
regular planar upward embedding satisfies strong constraints on the left-to-right 
ordering of the edges, (ii) Based upon the above results and under the assump- 
tion of regularity we present a linear time checker to test whether a given drawing 
r is upward planar. Our checker receives as input the set of vertices and bends 
of r (represented as pairs of integer coordinates), the set of oriented edges of T, 
and the embedding of F, i.e. the circular ordering of the edges incident on each 
vertex of F. An example of a drawing whose upward planarity can be checked by 
our algorithm is shown in Figure [D (iii) We further analyze the effectiveness of 
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our checker by adopting the notion of degree which takes into account the arith- 
metic precision required by the checker to carry out error-free computations. We 
show that our checker has (optimal) degree 2. 




Fig. 1. Example of drawing whose upward planarity can be checked with the 
algorithm presented in this paper. 



Our techniques do not exploit the polygon simplicity perspective but rely on 
the relationships between topology and geometry. Namely, in general triangulat- 
ing the faces of a drawing does not appear to be strictly necessary when testing 
its planarity; also, the faces of an upward drawing can be very different from 
those polygons for which a simple triangulation algorithm is known. 

2 Preliminaries 

We first recall the notion of algorithmic degree, then we recall basic definitions 
and properties of upward planar drawings and embeddings. We assume famil- 
iarity with basic graph and geometric terminology. See also HU IE]. 

The notion of degree as a measure of the precision that can be required by an 
error-free implementation of an algorithm has been introduced in 
briefly recall some terminology and results. 

The numerical computations of a geometric algorithm are basically of two 
types: tests (predicates) and constructions. Tests are associated with branching 
decisions in the algorithm that determine the flow of control. Constructions are 
needed to produce the output data of the algorithm. While approximations in 
the execution of constructions are acceptable, provided that the error magnitude 
does not exceed the resolution required by the application, approximations in 
the execution of tests may produce an incorrect branching of the algorithm, thus 
giving rise to structurally incorrect results. The exact-computation paradigm 
therefore requires that tests be executed with total accuracy. 

Geometric and graph drawing algorithms can be therefore analyzed on the 
basis of the complexity of their test computations. Any such computation con- 
sists of evaluating the sign of an algebraic expression over the input variables. 
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constructed using an adequate set of operators, such has , x,-7, This 
can be reduced to the evaluation of the signs of multivariate polynomials derived 
from the expression. 

A primitive variable is an input variable of the algorithm and has conven- 
tional arithmetic degree 1. The arithmetic degree of a polynomial expression E 
is the common arithmetic degree of its monomials. The arithmetic degree of a 
monomial is the sum of the arithmetic degrees of its variables. An algorithm has 
degree d if its test computations involve the evaluation of multivariate polyno- 
mials of arithmetic degree at most d. A problem has degree d if any algorithm 
that solves the problem has degree at least d. 

A straightforward consequence of a result in JS] is the following. 

Theorem 1. The upward planarity checking problem has degree at least 2. 

We borrow some terminology and results from B I2j Let G be a planar 
digraph. An upward drawing of G is a drawing such that all edges are repre- 
sented by curves monotonically increasing in a common direction, for example 
the vertical one. A digraph that admits an upward planar drawing is upward 
planar. 

An st-digraph is an acyclic digraph with exactly one source s and exactly one 
sink t and such that s and t are adjacent. An sT- digraph is an acyclic digraph 
with exactly one source s. 

Lemma 1. [13|, An upward planar digraph is a subgraph of a planar st- 
digraph. 

Let G be an embedded planar digraph. A vertex of G is bimodal if its incident 
list can be partitioned into two possibly empty linear lists, one consisting of 
incoming edges and the other consisting of outgoing edges. If all its vertices are 
bimodal then G and its embedding are called bimodal. A digraph is bimodal if it 
has a planar bimodal embedding. 

Let / be a face of a bimodal digraph G. Visit the contour of / counterclock- 
wise (i.e. such that the face remains always to the left during the visit). A vertex 
V of f with incident edges ei and C 2 is a switch if the the direction of e\ is 
opposite to the direction of 62 (note that ei and 62 may coincide if the digraph 
is not biconnected) . If ei and 62 are both incoming (outgoing) u is a sink switch 
{source switch) of /. Let 2ny be the number of switches of /. The capacity c/ of 
/ is defined to be n/ — I if / is an internal face and n/ -I- 1 if / is the external 
face. 

An assignment of the sources and sinks of G to its faces such that the following 
properties hold is upward consistent: (i) A source (sink) is assigned to exactly 
one of its incident faces, (ii) For each face /, the number of sources and sinks 
that are assigned to / is equal to c/. 

Theorem 2. Let G be an embedded bimodal digraph; G is upward planar if 
and only if it admits an upward- consistent assignment. 
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Let G be an embedded bimodal digraph that has an upward-consistent as- 
signment. According to Theorem |2 in 0 it is defined the concept of upward 
planar embedding of G as its planar embedding for which, for each face /, the 
switches of / are labeled S or L. A switch is labeled L if it is a source or a 
sink assigned to /, S otherwise. If / is internal, then the number of its switches 
labeled L is c/ — 1, else the number of its switches labeled L is Cf + 1. A face 
of G with the above properties is upward consistent. The circular list of labels 
of / will be usually called the labeling of / and denoted as tx/. Also, So-f (To-y) 
denotes the number of S'-labels (L-labels) of af. 

Observe that an embedded bimodal digraph can have many upward planar 
embeddings each corresponding to a certain upward-consistent assignment. 

Property 1. [2j For an upward consistent internal (external) face / we have: 

Suf = Luj + 2 (To-/ = Snj + 2). 

An immediate consequence is the following. 

Property 2. The labeling cr/ of an upward consistent internal (external) face / 
has at least two consecutive S'-labels (L-labels). 

Another consequence of the results in 0 is that, given a planar upward 
embedding of a digraph G it is always possible to construct a planar st-digraph 
including G by adding to G a new source s, a new sink t, edge (s,t), and a 
suitable set of (dummy) edges. Such edges connect either pairs of switches or 
external face switches with s or t. More formally, we can elaborate the concepts 
in 0 as follows. Given an upward planar embedded digraph G a saturator of G 
is a set of edges (each edge a saturating edge) plus two vertices s and t connected 
by edge (s,t). A saturating edge is such that: 

— A saturating edge can either connect two switches of the same face, or it can 
connect a sink switch labeled L of the external face to t, or it can connect s 
to a source switch labeled L of the external face. 

— For a saturating edge (u, v), u,v ^ s, t, either u is a source switch labeled S 
and u is a source switch labeled L or u is a sink switch labeled L and v is 
a sink switch labeled S. In the former case we say that u saturates v and in 
the latter case we say that v saturates u. 

— The faces obtained with the insertion of a saturating edge are upward con- 
sistent. 

Examples of upward planar embeddings can be found in FigureEl The dashed 
edge of Figure 0c) is a saturating edge. 

The set of saturating edges added to a face is also called saturator of that 
face. We mainly focus on properties of saturators of internal faces. Analogous 
properties hold for the external face. The following property relates the labeling 
of an internal face / to the labeling of the faces obtained when inserting a 
saturating edge. 
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Fig. 2. Examples of labelings, faces, and digraphs: (a) A planar upward embed- 
ded rooted tree, (b) A planar upward embedded st-digraph. (c) A planar upward 
embedded sT-digraph. The dashed edge is a saturating edge, (d) A more complex 
planar upward embedded digraph. 



Property 3. Let / be an internal face of an upward planar embedded digraph, let 
e be a saturating edge of /, and let a f = aiSa 2 L, where the S and L in evidence 
are the labels of the endvertices of e. Then, Sa^ = Lo-i + 1 and £'^2 = ^ 0-2 + 1- 

Proof. By the definition of saturator, both the faces resulting by the insertion 
of e are upward consistent. Observe that such faces have labeling tJiS' and Sa 2 - 
Hence, by Property [Owe have that Sa-^ = La-^ -h 1 and Sa -2 = + 1. 

A saturator of G is said to be complete if for every face / and for every switch 
It of / labeled L, u is an endvertex of an edge of the saturator. 

Property 4- Each upward planar digraph G is a subgraph of a planar st-digraph 
constructed by adding to G a complete saturator of an upward planar embedding 
of G. 

Observe that an upward planar embedding can have, in general, many com- 
plete saturators. 

Property 5. Each planar upward drawing of a digraph G is a subdrawing of a 
planar upward drawing of a planar st-digraph enclosing G and constructed by 
adding to G a complete saturator. 

3 Regular Upward Embeddings 

In this section we characterize the family of upward planar digraphs for which 
there exists a unique complete saturator. Also, we study the relationships be- 
tween topological properties and upward drawings of such digraphs. 

3.1 Regularity 

Our characterization is based on a certain type of labeling. Namely, let G be an 
upward planar embedded digraph. An internal face / of G has a regular labeling 
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if (7/ does not contain two distinct maximal subsequences cti and U 2 of iS-labels 
such that S'o-i > 1 and Sa^ > 1- An external face / of G has a regular labeling if 
(7/ does not contain two consecutive S'-labels. A face of G with a regular labeling 
is a regular face. For example all labelings of Figure E] are regular. 

Property 6. In the labeling of a regular internal face / with more than two 
switches, there is always a maximal subsequence of at least three consecutive 
S'-labels. 

Proof. By contradiction. Suppose that the labeling of the face / is ct/ = a\LSSL. 
By Property P we have Sc^ = L„^+2. Therefore, a\ has two consecutive S-labels 
which implies the non regularity of the face. 

Based on the notion of regular face, a family of upward planar embedded 
digraphs can be defined. An upward planar embedded digraph is regular if all 
its faces have a regular labeling. The corresponding embedding is also called 
regular. An upward planar digraph is regular if all its upward planar embeddings 
are regular. 

The following theorem lists important families of regular digraphs. 

Theorem 3. Rooted trees, planar st-digraphs, and upward planar sT -digraphs 
are regular. 

Proof. We prove the regularity by describing with regular expressions the la- 
beling of the faces of any upward planar embedding. Rooted trees have one 
(external) face in any upward planar embedding. The face is regular since its la- 
beling is LL{LS)*. See, for example, FigureEJa). Concerning planar st-digraphs: 
the external face is labeled LL and the internal faces are labeled SS. See, for 
example. Figure m.h). Upward planar sT-digraphs are such that the external 
face is labeled LL{LS)* and the internal faces are labeled SS{SL)*. 

An example of a more complex regular upward embedded digraph, that does 
not fall in any of the classes mentioned in Theorem El is shown in Figure E^d). 

In SectionElit has been observed that an upward planar embedding can have, 
in general, many complete saturators. The notion of regular embedding allows 
us to characterize those planar upward embeddings that have only one complete 
saturator. We start by studying how a switch labeled L can be saturated in a 
regular face. 

Two switches of a face / of an upward planar embedded digraph are neighbor 
switches if their labels are in the same subsequence cti of ct/ such that all labels 
in a I are S (L) labels. 

Lemma 2 . In a regular face f with more than two switches, if a switch u sat- 
urates a switch V, then u has at least one neighbor switch. 

Proof. Suppose the face is internal. By contradiction. Let the labeling of / be: 
af = GiLSLcf 2 L, where the last L-label is the label of v and the S'-label is the 
label of u. After the insertion of edge (u, v) the two new faces fi and /2 have 
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labeling aiLS and SLcj 2 , respectively. Also, by Property |21 ct/ has at least two 
consecutive S'-labels. Suppose wlog that such consecutive 5'-labels are in cri. It 
follows, by the regularity of cr/, that ct 2 does not have two consecutive 5'-labels. 
However, by Property 0 applied to face /2 this is not possible. 

Lemma 0 restricts the set of possible switches labeled S that can be used to 
saturate the switches labeled L in a regular face. The next lemma allows us to 
further restrict our attention to only one switch of such a set. The lemma holds 
for any (even non-regular) face of an upward planar digraph and will be used in 
its general form to prove a next theorem. 

Lemma 3. Let f be an internal faee of an upward planar embedded digraph, 
sueh that f has more than two switehes, and let u and v be two switehes of f. If 
V can saturate u, then none of the neighbor switches of v can saturate u. 

Proof. Let cry = aia2Sa^a4L be the labeling of / where: S is the label of v, L 
is the label of u, a2 and cr^ are the subsequences of the S'-labels of the neighbor 
switches of v (by Lemma0at most one of such subsequences may be empty), and 
cTi and (74 contain the remaining labels of cry. Clearly, = Lo-g = 0. Therefore, 
since v can saturate u, by PropertyElwe have that Sa-^ + 3^2 = La-^ -I- 1. It follows 
that Sa 2 = L(ti — Sai + 1. Observe that the position of the label of the switch 
saturating u in the sequence 028^^ is univocally determined by the values of 
Lo-i and ■ Hence, there is only one switch of / that can saturate u among v 
and its neighbor switches. 

We are now ready to prove one of the main results of this section. 

Theorem 4. An upward planar embedding has only one complete saturator if 
and only if it is regular. 

Proof. We concentrate on internal faces. Similar arguments hold for the external 
face. Let G be a digraph with a given regular upward planar embedding and let 
/ be an internal face of G. Lemmas 0 and 0 imply that for every switch v 
labeled L of / there exists a unique switch u such that (u, v) is a saturating 
edge. Hence, if an upward planar embedding is regular, then there is only one 
complete saturator. To prove the necessity, we now show that if an internal face 
/ of an upward planar embedded digraph G has a switch u labeled L and two 
distinct switches v and w such that both v and w can saturate u, then / (and 
hence G) is not regular. We have that cry = aiSa-2Sa3L, where the first S is 
the label of v, the second S is the label of w, and L is the label of u. Since v 
(w) can saturate u, the saturating edge (u,v) {{u,w)) splits / into two upward 
consistent faces. Thus, by Property 0 there are two consecutive S labels in o'lS' 
(S'o'a). By Lemma 0u and w are not neighbor switches. Hence, La-2 ^ 1- This 
implies the non-regularity of /. 

The topology of an upward planar embedded digraph G induces ordering 
relationships on its edges, which correspond to a set of geometric constraints 
that have to be satisfied by an upward planar drawing of G. Together with 
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regularity properties, the study of such ordering relationships has revealed to 
be a basic ingredient for the design of our efficient low-degree upward planarity 
checker. 

3.2 Precedence and Dominance 

Let G be a planar embedded st-digraph. Let ei and 62 be two distinct edges of 

i 

G. We say that ei is to the left of €2, and denote it as ei 62, when: 

1. there exists a drawing G of G and two distinct points pi G r{ei) and p2 G 
r{e2) such that y{pi) = y{p2) and x{pi) < x{p2)\ 

2. for any upward planar drawing G of G and for any two distinct points p\ G 
r{e\) and p2 G G(e2) such that y{p\) = y(j>2), we have that x{pi) < x(j>2)- 

The following properties can be easily proved. 

Property 1. For each pair (ui, ui), (zi2, U2) of edges of G, the relationship 

i 

(ui,vi) ~<G (u2,V2) holds if and only if: (1) it does not exist a directed path 
from s to t containing both edges, and (2) there exists a vertex w and two inter- 
nally disjoint directed paths tti and 7T2 such that tti contains (ui, vi), tt2 contains 
(u2, V2), 7Ti and 7T2 share w, and the edge of tti incident on w is to the left of the 
edge of 7T2 incident on w in the ordering of the incoming edges incident on w. 

i . 

Property 8. Let G be a planar embedded st-digraph. Relation -<g is transitive. 

We now define a precedence relationship between two edges of an upward 
planar embedded digraph. Let e\ and 62 be two distinct edges of an upward 

planar embedded digraph G. We say that e\ is to the left of 62, and denote it as 
i 

6i 627 when for each including st-digraph G obtained by adding a complete 

i 

saturator to G we have that e\ -<g' 02- 
From Property 0 it follows that: 

i 

Property 9. Let G be an upward planar embedded digraph. Relationship ^g is 
transitive. 

Let ei and 62 be two distinct edges of an embedded planar st-digraph G. We 

U 

say that ei is dominates 62, and denote it as ei -<g 62, when for any upward 
planar drawing P of G and for any two distinct points pi G G(ei) and p2 € P{e2) 
we have that y{pi) > y{p2)- 

U 

Property 10. For each pair ei,C2 of distinct edges of G e\ ^2 if and only if 
there exists a directed path from s to t containing both e\ and 62 and such that 
when going from s to t along the path e\ is encountered before 62. 
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Property 11 . Let G be a planar embedded st-digraph. Relation is transitive. 

We now define a dominance relationship between two edges of an upward 
planar embedded digraph. Let ei and 62 be two distinct edges of an upward 
planar embedded digraph G. We say that 62 dominates ei, and denote it as 

U 

ei -<G 62, when for each including st-digraph G obtained by adding a complete 

U 

saturator to G we have that ei -<g' ei- 
From Property rm it follows that: 

U 

Property 12 . Let G be an upward planar embedded digraph. Relationship ^g 
is transitive. 



Observe that since relationships ^g and are mutually exclusive for an 
embedded planar st-digraph, the following property holds. 

i 

Property 13 . Let G be an upward planar embedded digraph. Relationships <g 

U 

and <G are mutually exclusive. 



/ U 

Also notice that, because of Property y the relationships and de- 
fined for an upward planar embedded digraph give left-to-right and up-down 
constraint that hold for any drawing of G. 

We are now ready to characterize the upward planar embedded digraphs such 

U ^ I 

that for any pair of edges, they are in the -<g or in the ^g relationship. From 
the above discussion it follows a sufficient condition: 

Property I4. Upward planar embedded st-digraphs are such that for each pair 

I I U U 

Cl, 62 of edges either e\ <g 62 or 62 <g ei or e\ <g or 62 Ag e-i- 
A complete characterization is given in the following theorem. 

Theorem 5 . Let G he an upward planar embedded digraph. For each pair ei, 62 

I I U 

of edges of G it holds that either 6i ^g 62, or 62 Ag ei, or ei <g 02, or 

U 

62 <G 6i, if ond only if G is regular. 

Proof. First, we prove the sufficiency. Namely, we prove that if G is regular then 

i i 

for each pair ei, 62 of edges of G it holds that either ei <g 62, or 62 Ag oi, or 

U U 

6l Ag 62 , or 62 Ag 6i. 

If G is regular then, by Property El and Theorem El it follows that there 
exists a unique planar embedded sf-digraph G' enclosing G and constructed by 
adding a complete saturator to G. The sufficiency is immediately implied by 
Property ^ 

Suppose now, for a contradiction, that there exists a non-regular upward 
planar embedded digraph G such that between any pair of edges of G either the 

I U 

^G or the ^G relationship is defined. Let f he & non-regular face of G, let u be 
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a sink switch of / labeled L, and let v and w be sink switches labeled S that 
can saturate u (the proof is symmetric for the case that u, v, and w are source 

i 

switches). Let ei and 62 be the two edges of / incident on v such that e\ 62; 

i 

let 63 and 64 be the two edges of / incident on w such that 63 -<g 64. Finally, let 
65 and Cq be the two edges of / incident on u. Since both v and w can saturate 
u, consider two different planar embedded st-digraphs that include G: G' has 
the saturating edge (u,v), while G" has the saturating edge (u,w). 

II 

In G , we have that ci -<g’ 65 <g’ by Property □ Since for all pairs of 

I U 

edges of G either the or the ^g relationship is defined and since there exists 
an st-digraph (constructed with a complete saturator) including G for which 

i i 

65 ^G' 62, we can conclude that 65 is to the left of 62 also for G, i.e. 65 ^g 62- 

i i 

In G we have that 63 -<g» 65 64- With analogous reasoning as above, 

i 

we conclude that 63 ^g 65. 

I U I 

Now, four cases are possible: Either 62 ^g 63 or 63 ^g 62, or 63 -<g 62, or 

U 

62 ^G 63. We show a contradiction for the first two cases. The proof for the 
other cases is symmetric. 

i i 

Namely, if 62 ^g 63, since we have shown that 65 ^g 62, by the transitivity 

i i 

property (PropertjEj) it follows 65 <g 63. But we should also have 63 <g 65, 

u u u 

a contradiction. If, in turn, 63 -<g 62, in G we have that 65 <g" \u,w) <g” 

63 ^G" 62 (PropertyEJ ■ By the transitivity property lPror)ert\ ll ill it follows 

U ... . . 

that in G 65 -<g" 62 which implies that also in G it should be 65 ^g 62. But, 
i 

we should also have 65 ^g 62, a contradiction because of Property El 

4 Upward Planarity Checking 

Let T be a connected polygonal-line drawing of a digraph G with n vertices and 
bends. An upward planarity checker of T receives as input the set of vertices and 
bends of F represented as pairs of integer coordinates, the set of oriented edges 
of r, and the embedding of T, i.e. the circular ordering of the edges incident on 
each vertex of F. 

Our upward planarity checker executes three tests in sequence. If a test fails, 
then the checker rejects F, otherwise it executes the test that follows in the 
sequence. At the end of the procedure, either a certificate for F is provided or a 
message that rejects F providing evidence of the property that is not respected 
by F. The tests performed by the checker are listed below. 

Embedding-Test: Verify whether the given embedding is planar. This is equiv- 
alent to verifying whether there exists a drawing F' of G that preserves the 
given embedding and such that no two edges of F' cross. The bimodality of 
the given embedding is also verified. 
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Upwardness-Test: Verify 'whether F is an upward drawing. 
Non-Crossing-Test: Verify whether any two edges of F cross. 

The Embedding-Test can be executed in 0{n) time with the techniques 
described in 0. Since no geometric test is performed, then the Embedding Test 
does not affect the overall degree of the checker. 

The Upwardness-Test can be executed in 0{n) time by visiting F with a 
standard visiting procedure. The geometric test involves an immediate compar- 
ison of the y-coordinates of the end vertices of the edges. This requires degree 1. 

In jS] it is shown that a straight-line undirected drawing whose induced em- 
bedding is planar does not have any edge crossings if and only if all faces in the 
drawing are simple polygons. In the same paper, an efficient algorithms is pre- 
sented for checking convex planar drawings of undirected graphs. Unfortunately, 
neither the faces of an upward drawing are in general convex, nor they belong to 
classes of polygons for which the simplicity test can be easily realized (see also 
Section^). Hence, an efficient (linear-time) realization of the Non-Crossing- Test 
based on the polygon simplicity check for all faces of F would imply either us- 
ing the fairly complex triangulation algorithm by Chazelle jSj or developing an 
ad-hoc strategy. 

We show an optimal degree strategy for the Non-Crossing-Test. The strat- 
egy can be applied to all upward drawings of digraphs with a regular embedding. 
As already pointed out in the previous sections, several existing drawing algo- 
rithms for upward planar digraphs compute drawings that our checker is able 
to verify (see Theorem . The strategy exploits the relationship between the 
geometry of F and the topological properties of the represented digraph G. Our 
Non-Crossing-Test consists of three steps. 

1. We check if T is a drawing of a regular upward planar embedded digraph. 
This can be done by traversing the faces of F. During the traversal of a 
face /: (i) the labeling of / is computed; (ii) the upward-consistency of / is 
verified; (iii) the regularity of / is verified. 

2. We construct the including planar st-digraph of G with its unique (see The- 
orem 0 complete saturator. Let G' be such including st-digraph. Based on 
Theorem El we use G' to define a total left-to-right ordering in the set of 
maximal non-intersecting paths of F . 

3. We explore F by visiting one-path-at-time from left to right. To do this, we 
follow the ordering induced by G' . Namely, we start by visiting the edges of 
F that belong to the leftmost path tti of G' from s to t. A new path tt of G' 
is visited (and the edges of F that it contains) only after all paths composed 
by edges that are “to the left” of the edges of tt have been already visited. 
Basically, we follow an “ear-decomposition” of G' . By the theory developed 

in the previous section, this guarantees that for each edge e' that has been 

i 

already visited in F and for each non-saturating edge e oi tt, either e e 

u “ / 

or e e or e ^g e . 

We maintain a rightmost boundary FI of the drawing. At the first step, 77 
is the portion of F that represents the non-saturating edges of tt\. At the 
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generic step, the portion of F that represents the non-saturating edges of a 
new path tt is compared with 77. If an intersection occurs, then we stop and 
report the intersection. Else, we compute the new rightmost boundary by 
merging 77 and tt as follows: for each pair of points p in tt and P in n such 
that y{p) = y{P), P is deleted from 77 and is replaced by p. 

We are now ready to analyze the efficiency of our Non-Crossing-Test. Re- 
garding Step 1, traversing the faces of P can be done in 0{n) time. Also, in 
order to compute the labeling of a face /, for each switch of / it must be checked 
whether the angle inside / is reflex or not. This can be done with degree 1 by a 
simple comparison of input coordinates. Regarding Step 2, the construction of 
G' can be done in 0{n) time by exploiting the technique shown in the proof of 
Lemma El Also, computing G' does not require the execution of any geometric 
tests and thus it does not affect the overall degree of the Non-Crossing-Test. 
Regarding Step 3, the following lemma provides the geometric foundation to 
analyze its efficiency. 

Lemma 4. Checking whether the non-saturating edges of tt intersect the right- 
most boundary 77 can he done in 0{k) time and degree 2. The parameter k is 
equal to the number of vertices and bends in tt plus the number of vertices and 
bends in 77 whose y-coordinates are in the y-interval spanned by tt. 

Proof. We follow 77 and tt from top to bottom with a dove-tail strategy driven 
by the p-coordinates. At each step a which-side test is executed to determine 
whether a vertex or bend of tt is to the right of the corresponding segment of 77. 
The rightmost boundary 77 is represented as follows. A segment r of 77 is always 
a subsegment of an edge e of 7^. Thus, instead of explicitly storing the endpoints 
of r, we represent r by means of the endpoints of e plus the p-interval spanned 
by r. This is done to avoid the explicit computation of the x-coordinates of the 
endpoints of r that would affect the overall degree. 

By exploiting such implicit representation of 77, each which-side test cor- 
responds to evaluating the sign of a determinant that defines a multivariate 
polynomial of degree 2 and such tha all elements of the determinant are either 
constant values or the coordinates of the vertices and bends of P (primitive vari- 
ables). Since the primitive variables have degree 1 each which-side tests can be 
executed with degree 2. 

The above discussion. Lemma 0 and Theorem E imply the following. 

Theorem 6. Let P be a polygonal line drawing with n vertices and bends. There 
exists a checker that verifies whether P is an upward planar drawing of a digraph 
with a regular embedding that runs in 0(n) time and has optimal degree 2. 

5 Extensions and Open Problems 

Our optimal degree checker can be easily extended to verify quasi-upward planar 
drawings P|. Namely, the following theorem can be proved. 
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Theorem 7. Let F he a polygonal line drawing with n vertices and bends. There 
exists a checker that verifies whether F is a quasi-upward planar drawing of 
a digraph with a regular embedding that runs in 0(n) time and has optimal 
degree 2. 

Several checking problems remain open in graph drawing. Consider that all 
graph drawing algorithms guarantee certain geometric properties for the draw- 
ings they produce. Such properties are usually called “graphic standards” or 
“drawing conventions” . Some of them appear to be easy to check, while others 
like checking proximity drawings seem to be much harder. For example, no al- 
gorithm is known to efficiently check whether a drawing is a Gabriel drawing 0 
or a Relative Neighborhood Drawing [ZH. 
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Abstract. We present a method for constructing orthogonal drawings 
of graphs of maximum degree six in three dimensions. Such a method 
is based on generating the final drawing through a sequence of steps, 
starting from a “degenerate” drawing. At each step the drawing “splits” 
into two pieces and finds a structure more similar to its final version. Also, 
we test the effectiveness of our approach by performing an experimental 
comparison with several existing algorithms. 



1 Introduction 

Both for its theoretical appeal and for the high number of potential applica- 
tions, research in 3D graph drawing is attracting an increasing attention. The 
interest of the researchers has been mainly devoted to straight-line drawings and 
orthogonal drawings. 

Concerning straight-line drawings, many different approaches can be found 
in the literature. For example, the method in [Z| is based on carefully exploiting 
the “momentum curve” to guarantee no edge crossings and a volume 4n^, where 
n is the number of vertices of the graph to be drawn. The same paper presents 
another example of algorithm which constructs drawings without edge crossings 
of planar graphs with degree at most 4. It is based on folding a 2-dimensional 
orthogonal grid drawing of area h x v into a straight-line drawing with volume 
h X V. Force directed approaches have been exploited to devise the algorithms 
in 0ElinilElE3iEIE3- Also, the research on this type of drawings stimulated 
the investigation on theoretical bounds. Examples of bounds on the volume of a 
straight-line drawing can be found in P,E1E|. Further, special types of straight- 
line drawings have been studied in |2l 1 1 .41 D E] (visibility representations) and 
in (proximity drawings). 

Concerning orthogonal drawings, all the algorithms guarantee no intersection 
between edges and most of the results apply mainly to graphs with maximum 
vertex degree six. Biedl |2| shows a linear time algorithm (in what follows we call 
it Slice) that draws in O(n^) volume with at most 14 bends per edge. Eades, 

* Research supported in part by the ESPRIT LTR Project no. 20244 - ALCOM-IT 
and by the CNR Project “Geometria Computazionale Robusta con Applicazioni alia 
Grafica ed al CAD.” 
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Stirk, and Whitesides m propose a )-time algorithm based on augment- 

ing the graph to an Eulerian graph and on applying a variation of an algorithm 
by Kolmogorov and Bardzin m- The algorithm (we call it Kolmogorov) draws 
in volume with at most 16 bends per edge. Eades, Symvonis, and White- 

sides m propose two algorithms. Both work in time and are based on 

augmenting the graph to a 6-regular graph and on a coloring technique. A first 
algorithm (we call it Compact) draws in volume with at most 7 bends 

per edge while a second algorithm (we call it Three-Bends) draws in at most 
27n^ volume with at most 3 bends per edge. Papakostas and Tollis present 
a linear time algorithm (we call it Interactive) that draws in at most 4.66n^ 
volume with at most 3 bends per edge. The algorithm can be extended to draw 
graphs with vertices of arbitrary degree. Finally, Wood m presents an algo- 
rithm for maximum degree 5 graphs that draws in 0{rt') volume with at most 2 
bends per edge. Although the results presented in the above papers are interest- 
ing and deep, the research in this field suffers, in our opinion, the lack of general 
methodologies. 

In this paper we deal with the problem of constructing orthogonal drawings 
in three dimensions. We experiment several existing algorithms to test their prac- 
tical applicability. Further, we propose new techniques that have a good average 
behaviour. Our main target are graphs with number of vertices in the range 
10-100 that are crucial in several applications. The results presented in this pa- 
per can be summarized as follows. We present a new method for constructing 
orthogonal drawings of graphs of maximum degree six in three dimensions with- 
out intersections between edges. It can be considered more as a general strategy 
rather than as a specific algorithm. The approach is based on generating the 
final drawing through a sequence of steps, starting from a “degenerate” draw- 
ing; at each step the drawing “splits” into two pieces and finds a structure more 
similar to its final version. We devise an example of algorithm developed accord- 
ing to the above method, called Reduce-Forks. We perform an experimental 
comparison of Compact, Interactive, Kolmogorov, Reduce-Forks, Slice, and 
Three-Bends against a large test suite of graphs with at most 100 vertices. 
We measure the computation time and three important readability parameters: 
volume, average edge length, and average number of bends along edges. Our 
experiments show that no algorithm can be considered “the best” with respect 
to all the parameters. Concerning Reduce-Forks, we can say that it has a good 
effectiveness for graphs in the range 5-30 and, among the algorithms that have a 
reasonable number of bends along the edges (interactive, Reduce-Forks, and 
Three-Bends), Reduce-Forks is the one that has the best behaviour in terms of 
edge length and volume. This is obtained at the expenses of an efficiency that is 
much worse than the other algorithms. However, the CPU time do not seem to 
be a critical issue for the size of graphs in the interval. 

The paper is organized as follows. In Section 0 we present our approach 
and in Section 0 we show its feasibility. In Section 0 we describe Algorithm 
Reduce-Forks. In Section 0 we present the results of the experimental compar- 
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The interested reader will find at our Web site a cgi program that allows 
to use the experimented algorithms, and the test suite used in the experiments 
(www . dia . uniromaS . it /^patr igna/ 3dcube) . 

2 A Strategy for Constructing 3D Orthogonal Drawings 

An orthogonal drawing of a graph is such that all the edges are chains of segments 
parallel to the axes. A grid drawing is such that all vertices and bends have 
integer coordinates. A 01-drawing is an orthogonal grid drawing such that each 
edge has either length 0 or length 1 and vertices may overlap. A 0- drawing is 
a trivial 01-drawing such that each edge has length 0 and all vertices have the 
same coordinates. A 1-drawing is a 01-drawing such that all edges have length 
1 and vertices have distinct coordinates. (See Fig. HI) Observe that while all 
graphs have a 01-drawing, only some admit a 1-drawing. 




Fig. 1. A 1-drawing of a graph with ten vertices. 



Let G be a graph. A subdivision G\ of G is a graph obtained from G by 
replacing some edge of G with a path. A subdivision G 2 of G\ is a subdivision of 
G. There always exists a subdivision of G that admits a 1-drawing. We partition 
the vertices of G\ into vertices that belong also to G ( original vertices of G) and 
vertices that belong only to Gi {dummy vertices). 

A dummy path of Gi is a path consisting only of dummy vertices but, possibly, 
at the endpoints (that can be original vertices). A planar path of an orthogonal 
drawing of Gi is a maximal path whose vertices are on the same plane. A planar 
dummy path is self-intersecting if it has two distinct vertices with the same 
coordinates. 

We propose a method for constructing orthogonal grid drawings with all 
vertices at distinct coordinates and without intersections between edges (except 
at the common endpoints) . The drawing process consists of a sequence of steps 
such that each step maps a 01-drawing of a graph G into a 01-drawing of a 
subdivision of G. We start with a 0-drawing of G and at the last step we get a 
1-drawing of a subdivision Gi of G. Hence, an orthogonal grid drawing of G is 
obtained by replacing each path u,v of Gi, corresponding to an edge (u,u) of 
G, with an orthogonal polygonal line connecting u and v. 

The general strategy is as follows. Let Go be a graph, we consider several 
subsequent subdivisions of Gq. In each subdivision new dummy vertices are 
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introduced. In each subdivision we call original the original vertices of Go and 
dummy all the other vertices. We construct an orthogonal grid drawing F of 
Go in four steps. Vertex Scattering: Construct a 01-drawing of a subdivision 
Gi of Go such that all the original vertices have different coordinates and all 
planar dummy paths are not self-intersecting (we call it scattered 01-drawing). 
After this step dummy vertices may still overlap both with dummy and with 
original vertices. Direction Distribution: Construct a scattered 01-drawing 
of a subdivision G 2 of G\ such that, for each vertex u, v and all its adjacent 
vertices have different coordinates (we call it direction- consistent 01-drawing). 
In other words, after this step the edges incident on v “leave” v with different 
directions. Observe that this is true both in the case v is original and in the case v 
is dummy. Vertex-Edge Overlap Removal: Construct a direction-consistent 
01-drawing of a subdivision G3 of G 2 such that for each original vertex v, no 
dummy vertex has the same coordinates of v (we call it vertex-edge-consistent 
01-drawing). After this step the original vertices do not “collide” with other 
vertices. Observe that dummy vertices having the same coordinates may still 
exist. Crossing Removal: Construct a 1-drawing of a subdivision G4 of G3 (all 
the vertices, both original and dummy, have different coordinates) . Observe that 
r is easily obtained from the drawing of G4. 

Each step is performed by repeatedly applying the same simple primitive 
operation called split. Informally, such operation “cuts” the entire graph with 
a plane perpendicular to one of the axes. The vertices laying on the “cutting” 
plane are split into two subsets that are “pushed” into two adjacent planes. 

Given a direction d we denote by —d its opposite direction. A split parameter 
is a 4-tuple (d^P^cj)^ p), where d is a direction and P is a plane perpendicular 
to d. Function (j) maps each vertex laying on P to a boolean. Function p maps 
each edge (u,u) laying on P such that (j){u) ^ (j){v) and such that u and v 
have different coordinates to a boolean. Given a split parameter {d,P,(j),p), a 
split{d, P, (j), p) performs as follows (see Fig. El). (1) We move of one unit in the d 
direction all vertices in the open half-space determined by P and d. Such vertices 
are “pushed” towards d. (2) We move of one unit in the d direction each vertex 
u on P with (j){u) = true. (3) For each edge (u,v) that after the above steps 
has length greater than one, we substitute (u, v) with the new edges (u, w) and 
(u>, v), where w is a new dummy vertex. Vertex w is placed as follows. (3. a) If the 
function p(u, v) is not defined, then vertex w is simply put in the middle point 
of the segment u,v. (3.b) If the function p{u,v) is defined (suppose, wlog, that 
4>{u) = true and (p{v) = false), then two cases are possible. If p{u,v) = true, 
then w is put at distance I in the d direction from u. If p{u,v) = false, then w 
is put at distance I in the —d direction from v. 

Observe that a split operation applied to a 01-drawing of a graph G con- 
structs a 01-drawing of a subdivision of G. Also, although split is a simple prim- 
itive, it has several degrees of freedom that can lead to very different drawing 
algorithms. Further, by applying split in a “random” way there is no guarantee 
that the process converges to a 1-drawing. 
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Fig. 2. An example of split: (a) before the split and (b) after the split. Vertices 
with (j) = true {4> = false) are black (light grey). Edges with p = true {p = 
false) are labelled t (f). The little cubes are dummy vertices inserted by the 
split. 



3 Feasibility of the Approach 

Property 1. Let T be a scattered 01-drawing of a graph. Each edge of T incident 
to a dummy vertex has length 1. 



Property 2. Let T be a 01-drawing of a graph obtained by a sequence of split 
operations from a 0-drawing of another graph. Each edge of P incident to a 
dummy vertex has length 1 . 

Proof. Dummy vertices are created by split operations. In such operations they 
are placed at distance 1 from their neighbors. 



Property 3. Let Iq be a 0-drawing of a graph Gq. There exists a finite sequence 
of split operations that, starting from Ig, constructs a scattered 01-drawing of 
a subdivision of Gg. 

Proof. Trivial. All the splits can be performed with planes perpendicular to the 
same axis. Each split separates one original vertex from the others. Note that 
all the obtained dummy paths are drawn as straight lines and hence are not 
self-intersecting and that all vertices lie on the same line. 

Let u be a vertex. We call the six directions isothetic wrt the axes around u 
access directions of u. The access direction of u determined by traversing edge 
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(u, v) from u to is used by (u, v). An access direction of u that is not used by 
any of its incident edges is a free direction of u. 

Given a direction d isothetic to one of the axes and a vertex v, we denote 
Pd^v the plane through v perpendicular to d. 

Theorem 1. Let Pi be a scattered 01-drawing of a graph Gi, subdivision of a 
graph Gq. There exists a finite sequence of split operations that, starting from 
Pi constructs a direction- consistent 01-drawing of a subdivision ofGi. 

Proof. We consider one by one each vertex u with edges (u, v) and (u, w) that 
use the same access direction d of u. Since Pi is a scattered 01-drawing, at least 
one of V and w (say v) is dummy. Also, by Property [Owe have that all edges 
incident to u use a direction of u. Two cases are possible. Case 1: at least one 
direction d' of the free directions of u is orthogonal to d; see Fig. Ela. Case 2: 
direction —d is the only free direction of u; see Fig. |3b. 



Case 1: We perform split{d' , Pd' ,u, 4>, p) as follows. We set (j>{v) = true, all 
the other vertices of Pd',u have (j) = false. Also, p{u,v) = true, all the other 
edges in the domain of p have p = false. After split, edge (u,v) uses direction 
d' of u. The usage of the other access directions of u is unchanged. Also, all the 
other vertices still use the same access directions as before the split with the 
exception, possibly, of v (that is dummy). 

Case 2: Let d” be a non-free direction of u different from d. We perform 
the same split operation as the one of Case 1, using direction d" instead of d' . 
After split, edge (u,v) uses direction d" of u. At this point, since at least one 
direction of the free directions of u is orthogonal to d", we can apply the same 
split strategy of Case 1. 

Finally, we observe that the above split operations preserve the properties of 
the scattered 01-drawings. 




d 



(a) 



(b) 



Fig. 3. Cases in the proof of Theorem Cl 
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We define a simplier version of split{d, P, cj>, p), called trivial split {d, P), where 
4> is identically false for all vertices of the cutting plane, and, as a consequence, 
the domain of p is empty. 

Theorem 2. Let P^ be a direction- consistent 01-drawing of a graph G2, sub- 
division of a graph Gq. There exists a finite sequence of split operations that, 
starting from P2, constructs a vertex-edge-consistent 01-drawing of a subdivision 

0fG2. 

Proof. Consider one by one each original vertex u of Go such that there exists 
a dummy vertex v with the same coordinates of u. Let {v' , v) and (y, v") be the 
incident edges of v. By Property^ v, v' and v” have different coordinates. 

Let d' and d” be the directions of v used by {v' , v) and {v, v”), respectively. We 
perform trivial split {d' , Pd\v) and trivial split {d" , Pd>\v)- After such operations 
vertex v is guaranteed to be adjacent to dummy vertices w' and w" created by 
the performed splits. Two cases are possible: either d' = —d" or not. In the first 
case we define d'” as any direction orthogonal to d'; in the second case we define 
d'” as any direction among d' , d” , and the two directions orthogonal to d' and d" . 
We perform split{d'” , Pd'" ^v, p) as follows. We set 4>{v) = true, all the other 
vertices of Pd'" ,v have (f = false. All the edges in the domain of p have p = true. 
Note that now u and v have different coordinates, that each split preserves the 
properties of direction-consistent 01-drawings, and that each operation does not 
generate new vertex-edge overlaps. 

Two distinct planar paths pi and p2 on the same plane intersect if there exist 
two vertices one of pi and the other of p2 with the same coordinates. 

Theorem 3. Let P^ be a vertex- edge- consistent 01-drawing of a graph G3, sub- 
division of a graph Gq. There exists a finite sequence of split operations that, 
starting from P^, constructs a 1-drawing of a subdivision of G3. 

Proof. Since P^ is vertex-edge-consistent, all vertices have distinct coordinates 
but dummy vertices, that may still overlap. We consider the number x of th® 
pairs of intersecting planar dummy paths of Ps. Fig. 21 shows that x can be 
greater than one even with just two vertices with the same coordinates. If y = 0, 
then T3 is already a 1-drawing of G3. Otherwise, we repeatedly select a pair 
of intersecting planar dummy paths pi and p2 and “remove” their intersection, 
decreasing the value of %. Such removal is performed as follows. 

Let ui and v\ be the endpoints of pi. If u\ (ui) is an original vertex we per- 
form trivial split{di,Pdi,ui) {trivialsplit{di, Pd^^^m)), where di is the direction 
determined by entering pi from ui (ui). The value of x stays unchanged. We 
denote by ri (si) the dummy vertex possibly introduced by the trivialsplit. 

Let xi and X2 be two vertices, one of pi and the other of p2, with the same 
coordinates; xi and X2 are dummy. Let d a free direction of X2 such that —d is 
a free direction of X\. Since xi and X2 have both degree 2, direction d always 
exists. We perform split{d,Pd,xn4>, p), by setting <f>{v) = true for each v G pi 
and v ri, Si {false otherwise). All the edges in the domain of p have p = true. 
The proof is easily completed by showing the decrease of x after the split. 



94 



Giuseppe Di Battista, Maurizio Patrignani, and Francesco Vargiu 




Fig. 4. Two dummy vertices with the same coordinates originating 3 pairs of 
intersecting planar dummy paths. 



We have shown that split is a powerful tool in performing the steps of the 
method presented in Section E] Namely, each step of Vertex scattering, Direc- 
tion distribution. Vertex-edge overlap removal, and Crossing removal can be 
performed by a finite sequence of splits. 



4 The Reduce-Forks Algorithm 

An edge {u, v) is cut by split{d, P, 4>, p) if u and v were on the same P-plane 
before the split and are on different planes parallel to P after the split. A pair 
of adjacent edges cut by a split is a fork. 

Algorithm Reduce-Forks follows the strategy described in Sections El and El 
However, the steps of the approach are refined into a set of heuristics that can be 
summarized as follows. Vertex Scattering: We repeatedly apply the following 
strategy. We select two original vertices u and v of Go with the same coordinates. 
We consider the set of split operations that separate u from v and perform the 
one with “a few” forks. The number of forks is kept small since, intuitively, each 
fork will require at least one bend to be removed in the subsequent Direction 
distribution step. Direction Distribution: For each original vertex u of Gq 
with edges {u.,v) and {u,w) such that v and w have the same coordinates: (1) 
We compute all the planar dummy paths containing {u,v) and {u,w). (2) We 
determine all the split operations that separate such paths and that separate 
V from w. (3) We weight such splits according to the number of bends they 
introduce and to the number rid of vertices that become direction-consistent 
after the split. We have that 1 < < 2. (4) We select and apply the split with 

minimum weight. Vertex-Edge Overlap Removal: For each original vertex 
u of Go such that v has the same coordinates as u: (1) We compute all the 
planar dummy paths containing v. (2) We determine all the split operations that 
separate such paths from u. (3) We weight such splits according to the number 
of bends they introduce and to the number of crossings introduced by the split. 
(4) We select and apply the split with minimum weight. Crossing Removal: 
For each pair of dummy vertices u and v having the same coordinates: (1) We 
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compute all the planar dummy paths containing u or v. (2) We determine all 
the split operations that separate such paths and u from v. (3) We weight such 
splits according to the number of bends they introduce. (4) We select and apply 
the split with minimum weight. 

Concerning the Vertex scattering step, observe that a split with no forks is a 
matching cut. Unfortunately, the problem of finding a matching cut in a graph 
is NP-complete (see hence a heuristic solution is needed. A simple and 

efficient heuristic for finding a split with a few forks is described below. 

Let G be a graph with adjacent vertices u and v. We color black and red the 
vertices of G in the two sides of the split. Each step of the heuristic colors one 
vertex. At a certain step a vertex can be black, red or free (uncolored). At the 
beginning u is black, v is red, and all the other vertices are free. 

Colored vertices adjacent to free vertices are active vertices. Black (Red) 
vertices adjacent to red (black) vertices are boundary vertices. See Fig. O Each 
step works as follows. (1) If a boundary active black (red) vertex, say x, exists, 
then we color black (red) one free vertex y adjacent to x. This is done to prevent 
a fork between {x,y) and {x,w), where u> is a red (black) vertex. (2) Else, if an 
active black (red) vertex, say x, exists, then we color black (red) one free vertex 
y adjacent to x. This is done for not cutting edge (a;, y). (3) Else, we color black 
or red (random) a (random) free vertex. 




Fig. 5. Red, black, and free vertices in the Vertex scattering heuristic of Algo- 
rithm Reduce-Forks. 



It is easy to implement the above heuristic to run in linear time and space 
(a graph with at most six edges per vertex has a linear number of edges). 



5 Experimental Results 

The experiments have been performed on a Sun Sparc station Ultra- 1 by using 
SDCube [ 23 . All the algorithms have been implemented in C-|— F. The test suite 
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was composed by 1900 randomly generated graphs having from 5 to 100 vertices, 
20 graphs for each value of vertex cardinality. All graphs were connected, with 
maximum vertex degree 6, without multi-edges and self-loops. The density was 
relatively high: the number of edges was twice the number of vertices. 

We considered two families of quality measures. For the efficiency we relied on 
the time performance (cpu seconds); for the readability we measured the average 
number of bends along the edges, the average edge length, and the volume of 
the minimum enclosing box with sides isothetic to the axes. 

We compared algorithms Compact, Interactive, Kolmogorov, Reduce- 
Forks, Slice, and Three-Bends. Fig. El and Q illustrate the results of the com- 
parison. 

The comparison shows that no algorithm can be considered “the best”. 
Namely, some algorithms are more effective in the average number of bends 
(interactive, Reduce-Forks, and Three-Bends) while other algorithms per- 
form better with respect to the volume (Compact and Slice) or to the edge 
length (Compact, Interactive, and Reduce-Forks). In particular: (i) The av- 
erage number of bends (see Fig. El-b) is comparable for Interactive, Reduce- 
Forks, and Three-Bends, since it remains for all of them under the value of 3 
bends per edge, while it is higher for Compact and Slice, and it definitely too 
much high for Kolmogorov. Furthermore, Reduce-Forks performs better than 
the others for graphs with number of vertices in the range 5-30. Interactive 
performs better in the range 30-100. Another issue concerns the results of the 
experiments vs. the theoretical analysis. About Kolmogorov the literature shows 
an upper bound of 16 bends per edge HH while our experiments obtain about 
19. This inconsistency might show a little “flaw” in the theoretical analysis (or, 
of course, in our implementation). Further, about Compact the experiments show 
that the average case is much better than the worst case m ( ii) Concerning 
the average edge length (see Fig.0-a), Reduce-Forks performs better for graphs 
up to 50 vertices, whereas Compact is better from 50 to 100; Interactive is a 
bit worse, while the other algorithms form a separate group with a much lower 
level of performance, (iii) The values of volume occupation (see Fig. Hb) show 
that Compact and Slice have the best performance for graphs bigger than 30 
vertices, while Reduce-Forks performs better for smaller graphs. Examples of 
the drawings constructed by the algorithms are shown in Fig. 0 

As overall considerations we can say that Reduce-Forks is the most effective 
algorithm for graphs in the range 5-30. Also, among the algorithms that have a 
reasonable number of bends along the edges (Interactive, Reduce-Forks, and 
Three-Bends), Reduce-Forks is the one that has the best behaviour in terms 
of edge length and volume. This is obtained at the expenses of an efficiency 
that is much worse than the other algorithms. However, the CPU time do not 
seem to be a critical issue for the size of graphs in the interval. In fact, even for 
Reduce-Forks the CPU time never overcomes the sole of 150 seconds, that is 
still a reasonable time for most of the applications. 
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Fig. 6. Comparison of Algorithms Compact, Interactive, Kolmogorov, 
Reduce-Forks, Slice, and Three-Bends with respect to time performance (a) 
and average number of bends along edges (b) . 
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(b) 

Fig. 7. Comparison of Algorithms Compact, Interactive, Kolmogorov, 
Reduce-Forks, Slice, and Three-Bends with respect to average edge length 
(a) and volume occupation (b). 
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6 Conclusions and Open Problems 

In this paper we presented a new method for constructing orthogonal drawings 
in three dimensions of graphs of maximum degree 6, and tested the effectiveness 
of our approach by performing an experimental comparison with several existing 
algorithms. The presented techniques are easily extensible to obtain drawings of 
graphs of arbitrary degree with the following strategy. The Vertex scattering 
step remains unchanged. In the Direction distribution step for vertices of degree 
greater than six, we first saturate the six available directions and then we evenly 
distribute the remaining edges. The Vertex-edge overlap removal step remains 
unchanged. In the Crossing removal step we distinguish between crossings that 
are “needed” because of the overlay between edges that is unavoidable because 
of the high degree and the crossings that can be removed. For the latter type of 
crossings we apply the techniques presented in Section 0 

Several problems are opened by this work. (1) Devise new algorithms and 
heuristics (alternative to Reduce-Forks) within the described paradigm. (2) Ex- 
plore the trade-off, put in evidence by the experiments, between number of bends 
and volume. (3) Measure the impact of bend-stretching (or possibly other post- 
processing techniques) on the performances of the different algorithms. (4) De- 
vise new quality parameters to better study the human perception of “nice draw- 
ing” in three dimensions. 
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Abstract. We define the geometric thickness of a graph to be the small- 
est number of layers such that we can draw the graph in the plane with 
straight-line edges and assign each edge to a layer so that no two edges 
on the same layer cross. The geometric thickness lies between two previ- 
ously studied quantities, the (graph-theoretical) thickness and the book 
thickness. We investigate the geometric thickness of the family of com- 
plete graphs, {Kn}. We show that the geometric thickness of Kn lies 
between [(n/5.646) + 0.342} and [n/4] , and we give exact values of the 
geometric thickness of K„ for n < 12 and n G {15, 16}. 



1 Introduction 

Suppose we wish to display a nonplanar graph on a color terminal in a way that 
minimizes the apparent complexity to a user viewing the graph. One possible 
approach would be to use straight-line edges, color each edge, and require that 
two intersecting edges have distinct colors. A natural question then arises: for a 
given graph, what is the minimum number of colors required? 

Or suppose we wish to print a circuit onto a circuit board, using uninsulated 
wires, so that if two wires cross, they must be on different layers, and that we 
wish to minimize the number of layers required. If we allow each wire to bend 
arbitrarily, this problem has been studied previously; indeed, it reduces to the 
graph-theoretical thickness of a graph, defined below. However, suppose that we 
wish to further reduce the complexity of the layout by restricting the number of 
bends in each wire. In particular, if we do not allow any bends, then the question 
becomes: for a given circuit, what is the minimum number of layers required to 
print the circuit using straight-line wires? 

These two problems motivate the subject of this paper, namely the geometric 
thickness of a graph. We define dg{G), the geometric thickness of a graph G, to 
be the smallest value of k such that we can assign planar point locations to the 
vertices of G, represent each edge of G as a line segment, and assign each edge 
to one of k layers so that no two edges on the same layer cross. This corresponds 
to the notion of “real linear thickness” introduced by Kainen cni. 

A related notion is that of (graph-theoretical) thickness of a graph, 9{G), 
which has been studied extensively DElliHElIIIl and has been defined as the 
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S.H. Whitesides (Ed.): GD’98, LNCS 1547, pp. 102-^^21 1998. 
(c) Springer- Verlag Berlin Heidelberg 1998 



Geometric Thickness of Complete Graphs 103 



minimum number of planar graphs into which a graph can be decomposed. The 
key difference between geometric thickness and graph-theoretical thickness is 
that geometric thickness requires that the vertex placements be consistent at all 
layers and that straight-line edges be used, whereas graph-theoretical thickness 
imposes no consistency requirement between layers. 

Alternatively, the graph-theoretical thickness can be defined as the mini- 
mum number of planar layers required to embed a graph such that the vertex 
placements agree on all layers but the edges can be arbitrary curves 1101 . The 
equivalence of the two definitions follows from the observation that, given any 
planar embedding of a graph, the vertex locations can be reassigned arbitrarily 
in the plane without altering the topology of the planar embedding provided we 
are allowed to bend the edges at will mg. This observation is easily verified by 
induction, moving one vertex at a time. 

The (graph-theoretical) thickness is now known for all complete graphs P 
Eiisiiiain!, and is given by the following formula: 



0{K^) = 



1 , 

2 , 

3, 



1 < n < 4 
5 < n < 8 
9 < n < 10 
n > 10 



( 1 . 1 ) 



Another notion related to geometric thickness is the book thickness of a graph 
G, bt{G), defined as follows p. A book with k pages or a k-book, is a line L (called 
the spine) in 3-space together with k distinct half-planes (called pages) having 
L as their common boundary. A k-book embedding of G is an embedding of G in 
a fc-book such that each vertex is on the spine, each edge either lies entirely in 
the spine or is a curve lying in a single page, and no two edges intersect except 
at their endpoints. The book thickness of G is then the smallest k such that G 
has a fc-book embedding. 

It is not hard to see that the book thickness of a graph is equivalent to a 
restricted version of the geometric thickness where the vertices are required to 
form the vertices of a convex n-gon. This is essentially Lemma 2.1, page 321 of 
p. It follows that 0{G) < 0g{G) < bt(G). It is shown in P that bt{Kn) = \n/2\ . 

In this paper, we focus on the geometric thickness of complete graphs. In 
Section 0we provide an upper bound, 6g{Kn) < |"u/4]. In Section 0we provide 



3-\/7 

2 



(n-k I) 



— I 5.646 I ■ 



a lower bound. In particular, we show that 9g{Kn) > 

This follows from a more precise expression which gives a slightly better lower 
bound for certain values of n. 

These lower and upper bounds do not match in general. The smallest values 
for which they do not match are n G {13, 14, 15}. For these values of n, the upper 
bound on 9g{Kn) from Section El is 4, and the lower bound from Section 0 is 3. 
In Section m we resolve one of these three cases by showing that 9g{Ki^) = 4. 
For n = 16 the two bounds match again, but they are distinct for all larger n. 
Section 0 contains a table of the lower and upper bounds on 9g{Kn) established 
in this paper for n < 100 and lists a few open problems. 
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Fig. 1. Construction for embedding K 2 k with geometric thickness of fc/2, illus- 
trated for k = 10. (a) The inner ring, (b) The outer ring. The circle in the center 
of (b) represents the inner ring shown in (a). 



2 Upper Bounds 

Theorem 1. 6g{Kn) < [n/4]. 

Proof. Assume that n is a multiple of 4, and let n = 2k (so, in particular, k is 
even). We show that n vertices can be arranged in two rings of k vertices each, 
an outer ring and an inner ring, so that Kn can be embedded using only fc/2 
layers and with no edges on the same Ijper crossing. 

The vertices of the inner ring are arranged to form a regular fc-gon. For each 
pair of diametrically opposite vertices P and Q, consider the zigzag path as il- 
lustrated by the thicker lines in Figure Gl(a). This path has exactly one diagonal 
connecting diametrically opposite points (namely, the diagonal connecting the 
two dark points in the figure.) Note that the union of these zigzag paths, taken 
over all fc/2 pairs of diametrically opposite vertices, contains all edges con- 
necting vertices on the inner ring. Note also that for each choice of diametrically 
opposite vertices, parallel rays can be drawn through each vertex, in two op- 
posite directions, so that none of the rays crosses any edge of the zigzag path. 
These rays are also illustrated in Figure [D) a). 

By continuity, if the infinite endpoints of a collection of parallel rays (e.g., 
the family of rays pointing “upwards” in Figure (D^a)) are replaced by a suit- 
ably chosen common endpoint (so that the rays become segments), the common 
endpoint can be chosen so that none of the segments cross any of the edges of 
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the zigzag path. We do this for each collection of parallel rays, thus forming an 
outer ring of k vertices. This can be done in such a way that the vertices on 
the outer ring also form a regular k-gon. By further stretching the outer ring if 
necessary, and by moving the inner ring slightly, the figure can be perturbed so 
that none of the diagonals of the polygon comprising the outer ring intersect the 
polygon comprising the inner ring. The outer ring constructed in this fashion is 
illustrated in Figure [H'b'l. 

Once the 2k vertices have been placed as described above, the edges of the 
complete graph can be decomposed into k/2 layers. Each layer consists of: 

1. A zigzag path through the outer ring, as shown in Figure ^(b). 

2. All edges connecting V and V' to vertices of the inner ring, where V and 
V are the (unique) pair of diametrically opposite points joined by an edge 
in the zigzag path through the outer ring. (These edges are shown as edges 
connecting the circle with V and V in Figure m, and as arrows in Fig- 
ure ^a)). 

3. The zigzag path through the inner ring that does not intersect any of the 
edges connecting V and Y' with inner-ring vertices. (These are the heavier 
lines in Figure G](a).) 

It is straightforward to verify that this is indeed a decomposition of the edges 
of Kn into kj2 = n/4 layers. ■ 



3 Lower Bounds 



Theorem 2. For all n > 1, 



(^g{Kn) > 



max 

l<x<nj1 



(;)-43-y 

3n — 2x — 7 



In particular, for n > 12, 



0g{Kn) > 



3- V7 
2 



n-k 0.342 



> 



n 

5.646 



0.342 



(3.1) 



(3.2) 



Proof. We first prove a slightly less precise bound, namely 

(3.3) 

For graph G and vertex set X, let G[X] denote the subgraph of G induced by X. 
Let S be any planar point set, and let Ti, . . . be a set of planar triangulations 
such that every segment connecting two points in S is an edge of at least one of 
the Ti. Find two parallel lines that cut S into three subsets A, B, and C (with 
B the middle set), with |A| = |C| = x, where a; is a value to be chosen later. For 
any Ti, the subgraph Ti[A\ is connected, because any line joining two vertices 
of A can be retracted onto a path through Ti[A\ by moving it away from the 
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line separating A from B. Similarly, Ti[C] is connected, and hence each of the 
subgraphs Ti[A\ and Ti[C] has at least x — 1 edges. 

By Euler’s formula, each Ti has at most 3n — 6 edges, so the number of edges 
of Ti not belonging to Ti[A\ U Ti[C] is at most 3n — 6 — 2{x — 1) = 3n — 2x — 4. 
Hence 

^ < 2(^2) +/c(3n-2a;-4). (3.4) 




Solving for k, we have 



k > 



G)-4i) 

3n — 2x — 4’ 



and hence 



— 2x^ 
6n — Ax 



0 ( 1 ). 



(3.5) 



If a; = cn for some constant c, then the fraction in (LLLI) is of the form n(l — 
2c^)/(6 — 4c). This is maximized when c = (3 — 'Jl')j2. Substituting the value 
a; = (3 — 'Jl)nj2 into (Id., 61) yields (I.S.:ill . 

To obtain the sharper conclusion of the theorem, observe that by choosing 
the direction of the two parallel lines appropriately, we can force at least one 
point of the convex hull of S to lie in B. Hence, of the edges of Ti that do not 
belong to Ti [H] U Ti \C] , at least three are on the convex hull. If we do not count 
these three edges, then each Ti has at most 3n — 2x — 7 edges not belonging to 
Ti [A] U Ti [C] , and we can strengthen (13.41) to 





+ k{3n — 2x — 7), 



or 



k > 



G) - 20 - 3 

3n — 2x — 7 



(3.6) 



Since (ft. 61) holds for any x, (Id. II) follows. 

To prove (lO . let f{x) be the expression on the right-hand side of 13.61) . 
Consider the inequality f(x) > xq, where xq is a constant to be specified later. 
After cross-multiplication, this inequality becomes 



— — 7 — 2x')xo > 0. 



(3.7) 



The expression in the left-hand side of (E3 represents an inverted parabola in 
X. If we let X = Xq, we obtain 



9 

Tl^ 71 

Xq -b (8 - 3n)xo -bY---3>0, (3.8) 

and if we let a; = Xq + I we obtain the same inequality. Now, consider xq of the 
form An + B — e. Choose A and B so that if e = 0, the terms involving and 
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n vanish in llh.sll . This gives the values A = (3 — and B = -\/7(23/14) — 4. 

Substituting xq = An + B — e with these values of A and B into (EH), we obtain 

y?- e-n + (e^ - -3/28) > 0. (3.9) 

For e = 0.0045, (E3) will be true when n > 12. Therefore, for all x G [xq, xq + 1], 
f{x) > xq, when e = 0.0045 and n > 12. In particular, f{\xo~\) > Xq. Since k is 
an integer, EH follows from H.S.tiB . ■ 

4 The Geometric Thickness of 

The lower bounds on geometric thickness provided by equation (tl.ll) of The- 
orem 0 are asymptotically larger than the lower bounds on graph-theoretical 
thickness provided by equation (EH, and they are in fact at least as large for 
all values of n > 12. However, they are not tight. In particular, we show that 
^9(^15) = 4, even though (13.11) only gives a lower bound of 3. 

Theorem 3. Ogl^Ki^) = 4. 

To prove this theorem, we first note that the upper bound, 9g{Ki^) < 4, 
follows immediately from Theorem ^ 

To prove the lower bound, assume that we are given a planar point set S, 
with [S'! = 15. We show that there cannot exist a set of three triangulations of 

5 that cover all (2^) = 105 line segments joining pairs of points in S. We use 
the following two facts: (1) A planar triangulation with n vertices and b convex 
hull vertices contains 3n — 3 — b edges; and (2) Any planar triangulation of a 
given point set necessarily contains all convex hull edges. There are several cases, 
depending on how many points of S lie on the convex hull. 

Case 1: 3 points on convex hull. Let the convex hull points be A, B and C. Let 
Ai (respectively, Bi, Ci) be the point furthest from edge BC (respectively AC, 
AB) within triangle ABC. Let A2 (respectively, B 2 , C2) be the point next fur- 
thest from edge BC (respectively AC, AB) within triangle ABC. 

Lemma 1. The edge AAi will appear in every triangulation of S. 

Proof. Orient triangle ABC so that edge BC is on the a;-axis and point A is 
above the x-axis. For an edge PQ to intersect AAi , at least one of P or Q must 
lie above the line parallel to BC that passes through Ai. But there is only one 
such point, namely A. ■ 

Lemma 2. At least one of the edges A1A2 or AA2 will appear in every trian- 
gulation of S . 

Proof. Orient triangle ABC so that edge BC is on the a;-axis and point A is 
above the a;-axis. For an edge PQ to intersect A1A2 or AA2, at least one of P or 
Q must lie above the line parallel to BC that passes through A2. There are only 
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two such points, A and Ai. Hence an edge intersecting A 1 A 2 must necessarily 
be AX and an edge intersecting AA 2 must necessarily be AiV, for some points 
X and V outside triangle AA 1 A 2 . Since edges AX and AiY both split triangle 
AA 1 A 2 , they intersect, so both edges cannot be present in a triangulation. It 
follows that either A 1 A 2 or AA 2 must be present. ■ 

Now let Z be the set of 12 edges consisting of the three convex hull edges 
and the nine edges ppi,pp 2 ,pip 2 (where p G {A, B, C}). Each triangulation of 
S contains 39 edges, and since any triangulation contains all three convex hull 
edges, it follows from Lemmas 0 and 0 that at least 9 edges of any triangulation 
must belong to Z. Hence a triangulation contains at most 30 edges not in Z. 
Thus three triangulations can contain at most 30 • 3 + 12 = 102 edges, and hence 
cannot contain all 105 edges joining pairs of points in S. 

Case 2: 4 points on convex hull. Let A,B,C,D be the four convex hull vertices. 
Assume triangle DAB has at least one point of S in its interior (if not, switch A 
and C). Let A\ be the point inside triangle DAB furthest from the line DB. By 
Lemma n the edge AAi must appear in every triangulation of S, as must the 4 
convex hull edges. Since any triangulation of S has 38 edges, three triangulations 
can account for at most 3 • 33 + 5 = 104 edges. 

Case 3: 5 or more points on convex hull. Let h be the number of points on the 
convex hull. A triangulation of S will have 42 — h edges, and all h hull edges must 
be in each triangulation. So the total number of edges in three triangulations is 
at most 3(42 — 2h) + h = 126 — 5h, which is at most 101 for h> 5. 

This completes the proof of Theorem 01 



5 Final Remarks 



In this paper we have defined the geometric thickness, 9g, of a graph, a measure 
of approximate planarity that we believe is a natural notion. We have established 
upper bounds and lower bounds on the geometric thickness of complete graphs. 
Tabled contains the upper and lower bounds on 9g{Kn) for n < 100. 

Many open questions remain about geometric thickness. Here we mention 
several. 

1. Find exact values for 9g{Kn) (i-C., remove the gap between upper and lower 
bounds in Table d). In particular, what are the values for K 13 and K 14 ? 

2. What is the smallest graph G for which 0g{G) > 0(G)? We note that the 
existence of a graph G such that 6g{G) > 9{G) (e.g., Ki^) establishes Con- 
jecture 2.4 of fTTTj . 

3. Is it true that 9g{G) = O (0(G)) for all graphs G? It follows from Theoremd 
that this is true for complete graphs. 

4. What is the complexity of computing 0g(G) for a given graph G? In par- 
ticular, is it NP-complete? (Computing 0(G) is known to be NP-complete 

El-) 
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Table 1. Upper and lower bounds on 6g{Kn) established in this paper. 



n 


LB 


UB 


1- 4 


1 


1 


00 


2 


2 


9-12 


3 


3 


13-14 


3 


4 


15-16 


4 


4 


17-20 


4 


5 


21-24 


5 


6 


25-26 


5 


7 


27-28 


6 


7 


29-31 


6 


8 


32 


7 


8 


33-36 


7 


9 


37 


7 


10 



n 


LB 


UB 


38-40 


8 


10 


41-43 


8 


11 


44 


9 


11 


45-48 


9 


12 


49-52 


10 


13 


53-54 


10 


14 


55-56 


11 


14 


57-60 


11 


15 


61-64 


12 


16 


65 


12 


17 


66-68 


13 


17 


69-71 


13 


18 


72 


14 


18 



n 


LB 


UB 


73-76 


14 


19 


77 


14 


20 


78-80 


15 


20 


81-82 


15 


21 


83-84 


16 


21 


85-88 


16 


22 


89-92 


17 


23 


93-94 


17 


24 


95-96 


18 


24 


97-99 


18 


25 


100 


19 


25 



Note: Upper bounds are from Theorem ^ The lower bounds for n > 12 are from 
Theorem|2| with the exception of the lower bound for n = 15 which is from Theorem|51 
Lower bounds for n < 12 are from ( 11 . 111 . 
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Abstract. We describe a new approach for cluster-based drawing of 
very large graphs, which obtains clusters by using binary space partition 
(BSP) trees. We also introduce a novel BSP-type decomposition, called 
the balanced aspect ratio (BAR) tree, which guarantees that the cells 
produced are convex and have bounded aspect ratios. In addition, the 
tree depth is O(logn), and its construction takes 0(n log n) time, where 
n is the number of points. We show that the BAR tree can be used to 
recursively divide a graph into subgraphs of roughly equal size, such that 
the drawing of each subgraph has a balanced aspect ratio. As a result, we 
obtain a representation of a graph as a collection of 0(log n) layers, where 
each succeeding layer represents the graph in an increasing level of detail. 
The overall running time of the algorithm is 0(n log n + m + Dq{G)), 
where n and m are the number of vertices and edges of the graph G, 
and Do(G) is the time it takes to obtain an initial embedding of G. 
In particular, if the graph is planar each layer is a graph drawn with 
straight lines and without crossings on the n x n grid and the running 
time reduces to 0(n log n). 



1 Introduction 

In the past decade hundreds of graph drawing algorithms have been developed 
(e.g., see 0), and research in methods for visually representing graphical in- 
formation is now a thriving area with several different emphases. One general 
emphasis in graph drawing research is directed at algorithms that display an 
entire graph, with each vertex and edge explicitly depicted. Such drawings have 
the advantage of showing the global structure of the graph. A disadvantage, 
however, is that they can be cluttered for drawings of large graphs, where de- 
tails are typically hard to discern. For example, such drawings are inappropriate 
for display on a computer screen any time the number of vertices is more than 
the number of pixels on the screen. For this reason, there is a growing empha- 
sis in graph drawing research on algorithms that do not draw an entire graph, 
but instead partially draw a graph, either by showing high-level structures and 
allowing users to “zoom in” on areas of interest, or by showing substructures of 
the graph and allowing users to “scroll” from one area of the graph to another. 
Such approaches would be more suitable, for instance, for displaying very large 

* This research partially supported by ARO under grant DAAH04-96- 1-0013. 
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graphs, such as significant portions of the world wide web graph, where every 
web page is a vertex and every hyper- link is an edge. 

A common technique used in the scrolling viewpoint context is the fish- eye 
view lEJIISlEIl, which shows an area of interest quite large and detailed (such 
as nodes representing a user’s web pages) and shows other areas successively 
smaller and in less detail (such as nodes representing a user’s department and 
organization web pages). Fish-eye view drawings allow for a user to understand 
the structure of a graph near a specific set of nodes, but they often do not display 
global structures. 

An alternate paradigm is to display global structure present in a graph by 
clustering smaller subgraphs and drawing these subgraphs as single nodes or 
filled-in regions. By grouping vertices together into clusters in this way we can 
recursively divide a given graph into layers of increasing detail, which can then be 
viewed in a top-down fashion or even in fish-eye view by following a single path 
in a cluster-based recursion tree. If clusters of a graph can be given as input along 
with the graph itself, then several authors give various algorithms for displaying 
these clusters in two or three dimensions |01 [3 101 [H3 ■ If, as will often be 

the case, clusters of a graph are not given a priori, then various heuristics can 
be applied for finding clusters, say, using properties such as connectivity, cluster 
size, geometric proximity, or statistical variation (d[Hll^. Once a clustering 
has been determined, we can generate the layers in a hierarchical drawing of 
the graph, with the layer depth (i.e., number of layers) being determined by 
the depth of the recursive clustering hierarchy. This approach allows the graph 
to be represented by a sequence of drawings of increasing detail. As illustrated 
by Fades and Feng [^, this hierarchical approach to drawing large graphs can 
be very effective. Thus, our interest in this paper is to further the study of 
methods for producing good graph clusterings that can be used for graph drawing 
purposes. 

We feel that a good clustering algorithm and its associated drawing method 
should come as close as possible to achieving the following goals: 

1. Balanced clustering: in each level of the hierarchy the size of the clusters 
should be about the same. 

2. Small cluster depth: there should be a small number of layers in the recursive 
decomposition. 

3. Convex cluster drawings: the drawing of each cluster should fit in a simple 
convex region, which we call the cluster region for that subgraph. 

4. Balanced aspect ratio: cluster regions should not be too “skinny” . 

5. Efficiency: computing the clustering and its associated drawing should not 
take too long. 

The goal of this paper is to study how well we can achieve these goals for 
very large graph drawings using clustering. Previous algorithms optimize one or 
more of the above criteria at the expense of some of the rest. Our goal is to try 
to optimize all of them. Our approach relies on creating the clusters using binary 
space partition (BSP) trees, defined by recursively cutting regions with straight 
lines. 
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1.1 BSP Tree Based Clustered Graph Drawing 

The main idea behind the use of a BSP tree to define clusters is very simple. 
Given a graph G = (V, E), where n = \V\ and m = \E\, we can use any existing 
method to embed it, provided that method places vertices at distinct points in 
the plane (e.g., see For example, if G is planar we can use any existing 

method for embedding G in the plane such that vertices are at grid points, and 
edges of the graph are straight lines that do not cross 0, 0 CHI I2D| • Once the 
graph drawing is defined, we build a binary space partition tree on the vertices 
of this drawing. Each node v in this tree corresponds to a convex region R of the 
plane, and associated with u is a line that separates R into two regions, each of 
which are associated with a child of v. Thus, any such BSP tree defined on the 
points corresponding to vertices of G naturally defines a hierarchical clustering 
of the nodes of G. Such a clustering could then be used, for example, with an 
algorithm like that of Eades and Feng 0, who present a technique for drawing 
a 3-dimensional representation of a clustered graph. 

The main problem with using BSP trees to define clusters for a graph drawing 
algorithm is that previous methods for constructing BSP trees do not give rise 
to clustered drawings that achieve the design goals listed above. For example, 
the standard k-d tree and its derivates (e.g., see El EDI), which use axis-parallel 
lines to recursively divide the number of points in a region in half, maintain 
every criteria but the balanced aspect ratio. Likewise, quad-trees and fair-split 
trees (e.g., see 0Eni), which always split by a line parallel to a coordinate 
axis to recursively divide the area of a region more or less in half, maintain 
balanced aspect ratio but can have a depth that is &{n). In our graph drawing 
application, aesthetics are extremely important, as “fat” regions appear rounder 
and a series of skinny regions can be distracting. But depth is also important, for 
a deep hierarchy of clusterings would be computationally expensive to traverse 
and would not provide very balanced clusters. The balanced box-decomposition 
tree of Arya et al [H E] has O(logn) depth and has regions with good aspect 
ratio, but it sacrifices convexity by introducing holes into the middle of regions, 
which makes this data structure less attractive for use in clustering for graph 
drawing applications. Indeed, to our knowledge, there is no previous BSP-type 
hierarchical decomposition tree that achieves all of the above design goals. 

1.2 The Balanced Aspect Ratio (BAR) Tree 

In this paper we present a new type of binary space partition tree that is bet- 
ter suited for the application of defining clusters in a large graph. Our data 
structure, which we call the balanced aspect ratio (BAR) tree, is a BSP-type 
of decomposition tree that has 0(log n) depth and creates convex regions with 
bounded aspect ratio (i.e., so-called “fat” regions). The construction of the BAR 
tree is very similar to that of a k-d tree, except for two important differences: 

1. The BAR tree allows for one additional cut direction: a 45°-angled cut. 

2. Rather than insisting that the number of points in a region be cut in half at 
every level, the BAR tree guarantees that the number of points is cut roughly 
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Fig. 1. The multi-level view of a graph with each cluster represented by a single node. 
Note the relationship between the cuts on the left and the clusters on the right. 



in half every two levels, which is something that does not seem possible to 
do with either a k-d tree or a quadtree (or even a hybrid of the two) while 
guaranteeing regions with bounded aspect ratios. 

In short, the BAR tree is an 0(log n)-depth BSP-type data structure that creates 
fat, convex regions. Thus, the BAR tree is “balanced” in two ways: on the one 
hand, clusters on the same level have roughly the same number of points, and, 
on the other hand, each cluster region has a bounded aspect ratio. 

We show that a BAR tree achieves this combined set of goals by proving 
the existence of a cut, which we call a two- cut. A two-cut might not reduce the 
point size by any amount but maintains balanced aspect ratio and ensures the 
existence of a subsequent cut, which we call a one-cut, that both maintains good 
aspect ratio and reduces the point size by at least two-thirds. In Section 0 we 
formally define one- and two-cuts and describe how to construct a BAR tree. 

1.3 Our Results for Cluster-Based Graph Drawing 

In Section 0 we show how to use the BAR tree in a cluster-based graph drawing 
algorithm. The Very Large Graph Drawing (VLGD) algorithm runs in 0{nlogn-\- 
m -\- Dq{G)) time, where n and m are the number of vertices and edges in the 
graph G and Dq{G) is the time to embed G. If the graph is planar, the algorithm 
introduces no edge crossings and the running time reduces to 0(n log n). 

The algorithm creates a hierarchical cluster representation of a graph, with 
balanced clusters at each layer and with cluster depth O(logn). Each cluster 
region has a balanced aspect ratio, guaranteed by the BAR tree data structure. 
In the actual display of the clustered graph we represent the clusters either with 
their convex hulls, with a larger region defined by the BSP tree, or simply with 
a single node (see Figure P). 
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create_clustered_graph(T, G) 
K 

for i = k downto 0 

obtain Gi from K 
shrink K 

add the edges of T 
return C 



Fig. 2. Creating the clustered graph. 



2 Using a BSP Tree for Cluster Drawing 

Let G = (y, E) be the graph that we want to draw, where |U | = n and \E\ = m. 
The goal of our VLGD algorithm is to produce a 3-dimensional representation 
C of the embedded graph G given a BSP tree T. Similar to |S| we define the 
clustered graph G = (G,T) to be the graph G, and the BSP tree T, such that 
the vertices of G coincide with the leaves of T. An internal node of T represents 
a cluster, which consists of all the vertices in its subtree. All the nodes of T at a 
given depth i represent the clusters of that level. A view at level i, Gi = (Vi, Ei), 
consists of the nodes of depth i in T and a set of representative edges. The edge 
(m, v) € Ei if there is an edge between a and b in G, where a is in the subtree 
of u and b is in the subtree of v. In addition, each node u € T has an associated 
region, corresponding to the partition given by T. 

We create the graphs Hi in a bottom-up fashion, starting with and going 
all the way up to Hq, where k = depth(T). Define the combinatorial graph 
K = (y{K), E{K)), where initially V(K) = {u G T : depth(u) = fc} and 
E{K) = E{G). Notice that K is well defined since the leaves of T are exactly 
the vertices of G. 

At each new level i we perform a shrinking of K. Suppose u,v G V{K), and 
parent (m) = parent (u). We replace the pair by their parent and remove the 
edge (u, v) if it exists. We also remove any multiple edges that this operation 
may have created and maintain for each surviving edge a pointer to the original 
edge in G. Thus a shrinking of K consists of all such operations, necessary to 
transform K into a representation of G at one higher level in the tree T. 

At each level Gi is a subgraph of G with certain edges removed. In addition, 
the z-coordinate of a vertex u G is equal to i, that is, all the vertices in 
Gi are embedded in the plane given hy z = i. To obtain Gi from Gi+i, for 
i = 0,...,fc — 1, we use the combinatorial graph K from level z -I- 1. Initially 
Ei = Ei^i. We then perform a shrinking of K and while removing an edge from 
K we remove its associated edge from Ei. 

This algorithm (see Figure EJ runs in 0(n • depth(T) -|- m) time. Depending 
on the type of BSP tree used, we can maintain most but never all of the desired 
properties. For example, if T is a k-d tree the cluster regions do not have balanced 
aspect ratios. We next describe how to construct a BSP tree which satisfies all 
of our goal criteria. 
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3 The BAR Tree 

Let us now discuss in detail the definition of our particular BSP-type decom- 
position tree, the BAR tree, and its construction. We begin with some gen- 
eral definitions. Let a line have a canonical slope if it forms an angle with the 
x-axis that is 0°, 90°, or 45° (referred to as the x,y,z directions). Define a 
canonical cut to be a cut with canonical slope, and a canonical hexagon to be a 
hexagon whose sides have canonical slope and possibly degenerate length. The 
aspect ratio of a canonical hexagon i? is ar(i?) = max(diELmi(i?))/ min(dicmij(i?)) 
Vt, j G {x, y, z\, where diEmia;(i?) is the distance between the a;-slopes of R region 
in the Lm metric, and similarly for the others. Let the maximum aspect ratio be 
a constant, say a = 6. A region R has balanced aspect ratio if ar(i?) < a and a 
unbalanced aspect ratio otherwise. 

For simplicity of arguments and notations, we will use the Loo metric be- 
cause all other metrics allow for “skinnier” rectangles to be produced. Using 
this metric, the length ||2|| = ||z||oo of a diagonal cut (with slope 45°) is simply 
the vertical (or horizontal) distance between its endpoints. Also, the distance 
between two diagonal lines is one half of the vertical (or horizontal) distance 
between them. Throughout the paper we often refer to regions with balanced 
and unbalanced aspect ratios as fat and skinny regions, respectively. 

Suppose we are given a point set S in the plane, |5| = n, and an initially 
square region R containing S. We now introduce the BAR tree data structure, 
which divides R into cells such that the following properties are guaranteed: 

— Every cell is convex. 

— Every cell has balanced aspect ratio. 

— Every cell has no more than a constant number of points. 

— The tree has 0{n) nodes. 

~ The depth of the tree is O(logn). 

The structure is straightforward and reminiscent of the original k-d tree. 
Recall that in a k-d tree, every node i in the tree represents a cell region Ri and 
an axis-parallel cut partitioning Ri into two subregions. The leaves of the tree 
are cells with a constant number of points. In general, each cut divides the region 
into two roughly equal halves, and thus the tree has O(logn) depth and uses 
0{n) space. However, if the vast majority of the points are concentrated close 
to any particular corner of the region, no constant number of axis-parallel cuts 
can effectively reduce the size of the point set and maintain good aspect ratio. 
This is a serious problem with many applications and with ours in particular. 
As a result, an extensive amount of research has been dedicated to improving 
and analyzing the average case performance of k-d trees and its derivatives often 
concentrating on trying to maintain some form of balanced aspect ratio. 

3.1 Constructing the BAR Tree 

In this section we show how to construct a BAR tree T using the vertices of an 
embedded graph G, an aspect ratio parameter a and a balance parameter /?. 
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create_BAR_tree(_R, a, /3) 

if an (a, /3)-baianced one-cut I, exists in R 

{Ri,R2)^1{R) 

else let s be an Q-balanced two-cut in R 
(i?i, R 2 ) <— s(R) 

let s' be an {a, /l)-balanced one-cnt in Ri 
{Ri,R3)^s'(Ri) 
recurse on Ri 



Fig. 3. Creating the BAR tree. 



Before we proceed any further with proving the existence of the necessary one- 
and two-cuts, we must first formally define these cuts. 

Definition 1. Let i? be a convex region with aspect ratio a or less and n points 
inside it and let (3 be the balance parameter. Define a one-cut to be a canonical 
cut which divides R into two subregions R\ and R 2 such that for i = 1,2: 

1. Ri contains no more than (3n points. 

2. Ri has aspect ratio a or less. 

We say that a region R is one-cuttable if it has balanced aspect ratio and 
there exists a one-cut for R. 



Definition 2. Let i? be a convex region with aspect ratio a or less and n points 
inside it, and let [3 be the balance parameter. Define a two-cut to be a canonical 
cut, s, which divides R into two subregions i?i and i ?2 such that: 

1. Ri contains no less than j3n points. 

2. i ?2 has aspect ratio a or less. 

3. Ri is one-cuttable (call that cut s'). 

In other words, the sequence of two cuts, s and s', results in three regions 
each with balanced aspect ratio and each containing no more than (3n points. A 
region R is two-cuttahle if it has balanced aspect raion and there exists a two-cut 
for R. 

We are now ready to give the pseudo-code for the construction of a BAR 
tree (see Figure El- Here we use the notation {R\, R 2 ) ^ ^(R) as a shorthand for 
cutting the region R with a cut I which results in subregions i?i and R 2 ■ Given a 
and (3, a cut is a-balanced if the subregions produced have aspect ratio less than 
or equal to a. Similarly, a cut is /3-balanced if the subregions produced have less 
than or equal to a /3 fraction of the points in the original region. Finaly, a cut is 
(a, /3)-balanced if satisfies both conditions. 
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Fig. 4. The labels on the sides of a general canonical hexagon and the maximizing cuts 
in the respective directions. 



3.2 Two-Cut Existence Theorem 

Here we consider the correctness and performance our algorithm for constructing 
a BAR tree. In the creation of the BAR tree we rely on the existence of either 
a one-cut or a two-cut. Recall that we only make canonical cuts (cuts with x- 
angle 0°, 90°, 45°, referred to as the x, y, z directional cuts of “canonical” slope). 
Thus the regions we create are canonical hexagons. We state the following trivial 
lemmas and postpone the detailed proofs for the final version of the paper. 

Lemma 1. Given a canonical hexagon R with balanced aspect ratio, consider 
a line I with canonical slope and coincident to a (degenerate) side of R. Let’s 
sweep I upwards (inside R) until the region above I, P, has either maximum 
aspect ratio or is empty. Call the region P maximized in the direction of I and 
the defining cut, I, a maximal-cut. If P is not empty and we continue sweeping, 
the region above I will have an unbalanced aspect ratio until it becomes empty. 

Corollary 1 If the region P is maximized in the direction of I, ar(P) = a and 
min(dicmia;(P), diajnj,(P), diajiiz(P)) = diajn/(P). 

Lemma 2. If there exists a continuum of canonical cuts completely covering a 
region, R, in which each cut always yields two subregions of balanced aspect ratio, 
there exists a one-cut in R, for (3 > 2/3. 

Here we are extending a well-known geometric result which states that we 
can find a bisector in any direction. In our case, we have a continuum of cuts 
(in no more than three directions) which “cover” the entire region, while always 
maintaining balanced aspect ratios. 

What are some specific regions guaranteed to be one-cuttable? We describe 
two such regions that are needed for guaranteeing the existence of two-cuts. 
Before we look at some specific regions, let us define the sides of the regions 
we will be dealing with. Let x\ x’’’ , y\ y’’ , z^ , z” be the sides of a general canoni- 
cal hexagon. Furthermore, let be the cuts that maximize the 

regions in the respective directions as shown in Figure 0 
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Fig. 5. CIT and CRT regions. 



Lemma 3. Canonical isoceles trapezoidal (DIT ) regions and canonical right- 
angle trapezoidal ("CRT^ regions are one-cuttahle regions (see Figure\^. 

Proof. Let di = dicmij(i?). We analyze each region individually. 

(a) CIT regions: In a CIT with aspect ratio a define i5 = ||z’'|| = da; — ||a;’'||. 

From the shape of the region, we know that dx = dy and a;'’ = y'" . Recall 
that in the Loo metric, dz = /2 = y^/2. Since the object has aspect ratio a, 

we have dx/dz = a and 2dx/x'" = cr, and so 2dx = a{dx — d), which implies 
that dx = a6/{a — 2). We can sweep from and from y^ covering the entire 
region while maintaining balanced aspect ratio for cr > 4. Thus, such an isoceles 
trapezoid has a one-cut. 

(b) CRT regions: In a CRT with aspect ratio a, we have dx = exdy and so 
x^ajia — 1) = x'^ = dx = ady, which implies dy = x’' /{a — 1). We can sweep 
from y^ and from T’’ in the same manner for a > 4. Thus, such a right-angled 
trapezoid has a one-cut. 

It is easy to construct examples where a region R cannot be divided into 
roughly equal portions by a simple single cut (be it axis-alligned or diagonal- 
type cut). However, the following theorem shows that using a two-cut followed 
by a one-cut we can in fact divide the cells into convex subregions with the 
desired properties (balanced aspect ratio and less than a constant fraction of the 
region’s points). We omit the proof due to lack of space. 

Theorem 1. (Two-Cut Existence Theorem) A eanonical hexagon R which 
does not have a one-cut must have a two-cut. 



Theorem 2. Civen a point set S in the plane, we can construct a BAR tree 
representing a decomposition of the plane into regions in O(nlogn) time. 

Proof. Notice that a one-cut or a two-cut in any of the three canonical directions 
can be found in 0{n) time and that the depth of the tree is O(logn). 
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VLGD(G,a,/3) 
embed (G) 

T ^ create_BAR_tree(G, Q, /3) 

H ^ create_clustered_graph(T, G) 
display(ff) 



Fig. 6. Main algorithm. 



4 Using a BAR Tree for Cluster Based Drawing 

Let G = {V,E) be the graph that we want to draw. Once we obtain the em- 
bedding of G, using whatever algorithm is most appropriate for the graph, we 
associate with the graph the smallest bounding square, R, which we call G’s 
cluster region. Using the embedding and its cluster region, we create the BAR 
tree T, as described above. Each node u € T maintains region(it), cluster(u), 
and depth(u). Here cluster(u) is the subgraph of G which is properly con- 
tained in region(u). Recall that the depth of the tree T is k = O(logn). In 
our application of the tree structure to cluster-based graph drawing, we want 
every leaf to be at the same depth. Therefore, we propagate any leaf not at the 
maximum depth down the tree until the desired depth is reached. This is merely 
conceptual and does not require any additional storage space or change in the 
tree structure. 

Using the tree T, we create the clustered graph G, which consists of k layers. 
Each layer is an embedded subgraph of G along with the regions and clusters 
obtained from T. The layers are connected with vertical edges which are simply 
the edges in T. The other inputs to VLGD are the aspect ratio parameter a and 
the balance parameter, /3. Here, a determines the maximal aspect ratio of a 
cluster region in G, and /3 determines the cluster balance, the ratio of a cluster’s 
size to its parent’s. For a summary of the operations, see FigureEl 

Lemma 4. A call to VLGD(G, a, /?) for a = 6, f3 = 2/3 results in 2/3-balanced 
clustering with aspect ratio less than or equal to 6 and cluster depth O(logn). 

Proof. By construction, the clusters are /3-balanced and the cluster depth is 
equivalent to the depth of T. For a > 6 and /3 > 2/3 the depth is Oilogi/p n). 

Theorem 3. For ct > 6, /3 > 2/3, algorithm VLGD creates a 2/3-balanced clus- 
tered graph G in 0(n log n -|- m -|- Dq{G)) time. 

Proof. The proof follows directly from the construction of the algorithm and 
previous statements about the running time of each component. 

Once we obtain the clustered graph G, we can display it as a 3-dimensional 
multi-layer graph representing each cluster by either the the convex hull of its 
vertices or by its associated region in the BAR tree. Along with the clustered 
graph G we can display a particular cluster with more details. Thus we provide 
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Fig. 7. Graph G with an inherently large cut. Any cut L which maintains an /3-balance 
between the clusters, where 1/2 < /3 < 1, cuts 0(n) edges. 



the global structure using the clustered graph and the local detail using the 
individual clusters. 

4.1 Planar Graphs 

When the graph G is planar, we are able to show a few special properties of our 
clustered drawings. 

Theorem 4. If G is planar, for a > 6, f) > 2/3, algorithm VLGD creates a 2/3- 
halanced clustered graph C in 0{nlogn) time. Moreover, C is embedded with 
straight lines and no crossings on the n x n x k grid, where k = O(logn). 

Proof. We begin with a planar grid embedding with straight-line edges P1IEIE3 
and then the original layer, Gk, is planar. Since each successive layer is a proper 
subgraph of the previous layer, it too must be planar and drawn without edge 
crossings. 

It is possible to have an edge cross a region that is does not belong to. 
Moreover, it is possible to have an edge cross the convex hull of a cluster that it 
does not belong to. If we represent a cluster by the convex hulls of its connected 
components, however, there will be no such crossings. Thus, if we could guarantee 
that each cluster is connected or has a small number of connected components, 
the display of the graph can be improved even further. Alternatively, we can 
redefine the clusters at each level to be the connect components of vertices 
inside each cluster region of the BAR tree. With this definition of clusters we 
could then use the algorithm of Fades and Feng |E| to produce a new clustered 
embedding of the planar graph so as to have no edge or region crossings. 

4.2 Extensions 

Throughout this paper we do not discuss the cut sizes produced by our algorithm, 
that is the number of edges intersected by a cut line in the BAR tree. In some 
applications it is important that the number of such edges cut be as small as 
possible. There exist graphs, however, that do not allow for “nice” cuts of small 
size. Consider the star graph G on Figure Q Any cut, which maintains a /3- 
balance between the two subgraphs it produces, intersects 0{n) edges. If the 
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Fig. 8. The graph in part (a) has no /3-balanced line cut of size better than 0(n) but it 
does have a cycle cut (the dotted circle) of size 0(1). We can transform the graph in (a) 
to the graph in (b) by taking one of the faces crossed by the cycle as the outer face. Note 
that in (b) the cycle cut has become a line and its size is 0(1). 



balance parameter is [3 = 1/2, the cut contains edges. As this example 
shows, we cannot hope to guarantee cut sizes better than 0(n). Still, if the 
given graph has a small cut then we would like to find a small cut as well. 

Minimizing the cut size violates two of our five criteria, namely, speed and 
convexity. First of all, looking for the best /3-balanced cut is a computationally 
expensive operation, and while it can be done in polynomial time, it is not hard 
to see that it cannot be done in linear time. In addition, the best /3-balanced 
cut may not preserve the convex cluster drawing property that VLGD maintains, 
which may result in new edge crossings in our clustered graph. 

Our algorithm does not guarantee that it will find the optimum /3-balanced 
cut but we can modify the BAR tree construction so that we find locally optimal 
cuts. Here are some of the possible criteria that we can use in choosing among 
the potential cuts: minimize cut size, minimize connected components resulting 
from a given cut, minimize aspect ratio, maximize /3-balance. 

These criteria can also be combined in various ways to produce desired scoring 
methods. In finding such optimal cuts, it is important to note that a one-cut, 
if available, might not always be a better choice over a potential two-cut. Yet 
again, a two-cut that minimizes the cut size may have no subsequent one-cut 
that does not cut many more edges. Thus, it may be reasonable to go two levels 
in evaluating possible scores instead of choosing greedily. 

5 Conclusion and Open Problems 

In this paper we present a straightforward and efficient algorithm for displaying 
very large graphs. The VLGD algorithm optimizes cluster balance, cluster depth, 
aspect ratio and convexity. Our algorithm does not rely on any specific graph 
properties, although various properties can aide in performance, and produces 
the clustered graph in a very efficient 0(n log n -I- m -I- Dq(G)) time. 

The embedding of the cluster graph is determined in the very first step of 
our algorithm. Unfortunately, it is possible that the initial embedding is not the 
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best one. In fact, as shown on Figure 0 G may have a minimum /^-balanced cut 
of size 0{n) or 0(1), depending on the embedding. While it is still true that 
some graphs may always have cuts of size 0(n) (for example, the star graph. 
Figure 0 , we would like to minimize the cut whenever we can. It is an open 
question whether it is possible to determine the optimal embedding, one that 
yields the minimum /3-balanced cuts. 

Another open question is related to the separator theorems of Lipton and 
Tar j an and Miller m- Is it possible given a 2-connected planar graph G to 

always produce 0{^fdn) /3-balanced cuts, where d is its maximum degree, and n is 
the number of vertices? If so, can we find an embedding for the resulting clustered 
graph which preserves efficiency, cluster balance, cluster depth, convexity, and 
guarantees good aspect ratio and straight-line drawings without crossings? 
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Abstract. Orthogonal drawings of graphs are widely investigated in the 
literature and many algorithms have been presented to compute such 
drawings. Most of these algorithms lead to unpleasant drawings with 
many bends and a large area. We present methods how to improve the 
quality of given orthogonal drawings. Our algorithms try to simulate the 
thinking of a human spectator in order to achieve good results. We also 
give instructions how to implement the strategies in a way that a good 
runtime performance can be achieved. 



1 Introduction 



Among the various layout styles orthogonal drawings play a central role in graph 
drawing. There are many applications like ER-diagrams and workflow visualiza- 
tion where orthogonal drawings are required. The quality of these drawings is 
usually estimated by analyzing them with respect to some cost function like the 
amount of used area, the number of bends and various others. 

The two main categories of algorithms consist of 

— fast algorithms (almost linear time) which guarantee a good worst-case per- 
formance but usually produce drawings of poor quality. Examples for such 
approaches are C3HIIE1. 

— good algorithms which guarantee good quality, but have a slow performance 

If we look at this situation from a practical point of view we can easily state that 
these two classes are disjoint. In several papers, the approaches have been experi- 
mentally compared (see e.g. 0) and the good algorithms clearly outerperformed 
the fast algorithms with regard to the quality of the drawings. 

* Parts of this work were done during this author’s stay at the Universitat Tubingen. 
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Nevertheless, it is evident that not only the results of the fast algorithms but 
also of the good algorithms can be more or less drastically improved by man- 
ual modifications in most cases. In this paper, we present a framework for effi- 
cient local changes to improve the quality of any given orthogonal drawing. We 
demonstrate how the drawings produced by a fast algorithm or even by a good 
algorithm can be improved such that the result will be clearly better than the 
solution directly found by good approaches. After having done our automatic 
improvements, even an experienced layout person will have a hard time to find 
places for further manual improvements. 

Since the algorithm improves the quality of the drawing step by step, this process 
can be done interactively. The user can determine (by passing a parameter) how 
much time he likes to spend. The more time he is willing to wait, the better the 
quality of the resulting drawing. 

In the following we present four methods to save bends or area: Moving, Match- 
ing, Morphing and Merging. We demonstrate the power of these strategies by 
giving examples. 



2 The Model 



To construct orthogonal drawings, the drawing plane is subdivided by horizontal 
and vertical gridlines of unit spacing A. The vertices of the graph are represented 
by rectangles of size {wX — ^) x {hX — -|) for some positive integers w and h. 
They are placed such that the borders of the rectangles overlap the gridlines by 
j. Such drawings have the nice property that the distance between two vertices 
on gridpoints with unit distance is the same as the size of the smallest possible 
vertex. The intersections of the gridlines with the rectangle define the set of 
ports, namely the positions where the start (end) points of the incident edges 
may be placed. All segments of the edges are placed on gridlines. They may 
cross but not overlap, and also the rectangles representing the vertices must be 
disjoint. See Fig. Q for an example; the gridpoints are marked with a cross in 
this figure. 

Orthogonal drawings naturaly support the drawing of vertices of degree at most 
4; the strategy described above permits to handle vertices with a higher degree 
in the same natural way. One of the most appealing drawing models within 
this framework is the so-called bignode-model (see e.g. |7f d] ) . Here the number 
of vertical gridlines intersecting the rectangle for vertex v (roughly the width 
of v) is determined by the maximum of the number of incident edges at the 
top and bottom side of the rectangle. The height of v is restricted analogously. 
Thus every vertex is as large as it must be to allow a proper orthogonal edge 
distribution. Many previous approaches do not meet this criterion (e.g. GIOTTO 
m, visibility representations (e.g. mm)) and might lead to drawings with 
unnecessarily large vertices. 
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Fig. 1. Notation of the used model. 



3 The 4M- Algorithm 

3.1 Moving 

Moving is an operation that does not change the angles and bends of the drawing 
but the length of edge segments. Thus Moving cannot save bends, but it is a 
very powerful method to save area. 

The basic idea of Moving is illustrated in Fig. El 




Fig. 2. A simple example for Moving. 



The part of the drawing at the right side of the dashed line is moved by one unit, 
reducing the width of the drawing. This seems to be an easy operation, but Fig. 
0 shows that even for very small graphs the situation can be more complicated. 

The problem of formulating and finally implementing this method reduces to 
defining the possible course of the dashed line which we call moving-line. We 
perform Moving in two phases, a horizontal phase and a vertical phase. In the 
following we only describe the horizontal phase (the vertical phase can be for- 
mulated analogously). 

Let r be an orthogonal drawing and F' the (planar) drawing where bends and 
edge crossings are replaced by artificial vertices. Subdivide every face of F' into 
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Fig. 3. A more difficult example for the moving-line. 



horizontal stripes by adding artificial vertices and edges such that every stripe is 
a (not necessarily bounded) rectangle (see Fig. 0 for an example). The resulting 
drawing is called F” 0 . 




Fig. 4. An orthogonal drawing is partitioned into stripes. 



The purpose for generating the drawing F” is to avoid overlapping parts in the 
resulting drawing. 



Definition 1 (moving-line J) An area-saving moving-line J is a line that ful- 
fils the following conditions: 

a) J is directed and consists of horizontal and vertical pieces. 

b) J starts above of the topmost object of F" and ends below the bottommost 
object of F" . 

c) J does not intersect any vertical edge of F" . 

d) Every horizontal edge of F" that is intersected by a piece of J which is 
directed downward has a finite length larger than or equal to two. 



After finding a moving-line J the new (smaller) drawing can be constructed by 
decreasing the length of every edge of F" that is intersected by J in downward 
direction by one unit and by increasing by one unit the length of every edge of 
F" that is intersected by J in upward direction. Since J intersects the border of 
F" in downward direction the total area of the drawing decreases. Moving can 



^ All operations described in the rest of the paper work on F" . We will only describe 
horizontal operations. 
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be intuitively seen as separating the drawing in two parts (at the left side and at 
the right side of the moving-line) and then moving one part closer to the other 
one. 

Finding a moving-line can be implemented as finding a path in the planar dual 
graph (which is well-defined since F” is planar). We use dfs for that purpose. 

Note that the moving-line is not necessarily monotonous in y-direction. In general 
it is necessary to increase the length of some edges in order to shorten other ones 
(see Fig.Elfor an example). For practical applications however, we often only look 
for monotonous moving-lines; by that, we can perform several moving-operations 
simultaneously, which is more efficient than in the general case (see Section 0 . 

Remarks: Moving resembles the compaction techniques developed for VLSI lay- 
out canini. The three remaining operations (Matching, Morphing, Merging) 
are novel. The idea of dividing a drawing in two parts and modifying one part 
was first mentioned in uni, where one part of the drawing was turned by 90° in 
order to save bends. 



3.2 Matching 



The next operation called Matching is designed for saving bends. The idea is to 
save a bend b on an edge e = (u, v) by moving either u or u to the place of b (to 
match a bend with a vertex). See Fig. 0for an example. 




Fig. 5. Matching bend b with vertex v using a matching- line. 



We use the intuition given in the previous section. Let e be an edge incident to a 
vertex v. Moving v to the geometric place of a bend 6 on e can be performed by 
finding a line (analogously to the moving-line) that separates the drawing in two 
parts, V and b being in different parts. It is not necessary to move a part of the 
border of the drawing because we do not want to save area. Thus the new line 
(the matching-line) J is a closed simple line that does not necessarily traverse 
the outer face. 

Definition 2 (matching-line J) A bend-saving matching-line J is a line that 
fulfils the following conditions (let s be the segment of e between v and b): 
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a) J is directed and consists of horizontal and vertical pieces. 

b) J starts and ends in the face at the side of e where b has its 27Q° -angle and 
intersects the segment s 

c) J does not intersect any vertical edge of F" . 

d) Every horizontal edge of F" (besides s) that is intersected by a piece of J 
which is directed downward has a finite length larger than the length of s. 

See Fig. E|for an example. The new drawing can be constructed by shifting the 
whole part of the drawing F" ‘inside’ of the matching-line to the left such that 
V replaces b. 

3.3 Morphing 

Morphing is a procedure that saves bends by resizing vertices. The basic idea of 
Morphing is illustrated in FigEl 

1) Save a bend next to vertex v by expanding v such that it now covers the 
bend. 

2) Resize the vertex back to the smallest possible size. 




Fig. 6. The two main steps of the Morphing-algorithm. 



There are three conditions why Morphing can fail in the bignode-model: 

a) The resulting vertex intersects with another object of the graph. 

b) The resulting vertex does not fit the bignode-condition anymore (minimal 
possible size). Two problems must be distinguished here: 

bl) The vertex became wider. Thus the number of edges incident to its top 
respectively bottom side must be checked. 
b2) The number of edges incident to the vertex’ right side was decreased. 
Thus the height of the vertex must be checked. 

Fig.Q shows examples where Morphing is not possible because of either of the 
three above reasons. Note that in the middle and in the right drawing v is not 
a legal bignode. 
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Fig. 7. Three cases where Morphing is impossible. 



Let ti be a vertex in an orthogonal drawing F" with an incident edge e; let b 
be a bend on e and s the segment of e between v and b such that s has length 
one. Each of the three conditions can be seen as moving parts of the drawing: 
condition a) demands moving not the whole vertex, but moving the right border 
of V to the column of b. In Fig. El the right border of v was moved to the bend 
whereas the left border kept its original position. Condition bl) says that if the 
number of edges incident at the bottom side of v is smaller than the number 
of edges incident to v at its top side, then we must move the right border of v 
and a part of the lower border of v to the left in order to remove unused ports 
on both horizontal borders of v. Condition b2) says that if the number of edges 
incident to the left side of v is smaller than the number of edges incident to v 
at its right side, then we must move the lower border of v (and possibly a part 
of the left border, too) upward in order to remove unused ports at both vertical 
sides of V. 

Analogously to Moving and Matching we can describe these operations with a 
morphing-line. Since we want to change the shape of a vertex we regard the ver- 
tex in question as a face and allow the morphing-line to traverse that face (resp. 
vertex). By that some parts of the border of the vertex are extended or short- 
ened. The necessary conditions for the morphing-line can easily be developed 
analogously to Moving and Matching and are skipped here. 

Fig. 0 illustrates that the morphing- line consists of three parts, all of them 
traversing the vertex v. Rather than regarding each part as a separate line we 
link the three parts together and consider them as one line. That enables us to 
test all three conditions simultaneously. Note that an additional Moving at the 
end would even save two gridlines. 
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Remark: The inverse operation to Morphing was described in HH, where it 
was used to shrink vertices by introducing bends in order to get an orthogonal 
drawing out of a visibility representation. 



I I ~ ^ ' I 

^ ^ ' . r ~ 
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Fig. 8. The three parts of the morphing-line. 



3.4 Merging 

Moving tries to save area, Matching and Morphing are designed to save bends. 
Our last algorithm has the purpose of reduce the size of vertices. In mosts cases 
this also means to save area. The basic idea of this method is to merge locally 
two gridlines into one. Merging is a two-step procedure, Fig. 0 shows an example 
for a simple Merging. First we perform a Morphing in the opposite way: Vertex 
V is contracted by introducing a bend on the edge {v,w). This means that we 
have to find an inverse morphing-line to resize the vertex. In the second step 
this bend is matched to w. Thus the final drawing has as many bends as before, 
but we reduced the size of one vertex. Like for Morphing we can perform both 
steps simultaneously in one call of a pathfinding procedure by linking together 
the lines to a merging-line (the exact definition of this line can easily be derived 
from the results in the previous sections). 
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Fig. 9. An example for Merging. 



4 Implementation Issues and Time Complexity 

4.1 Worst Case Analysis 

After having presented the ideas behind our algorithms we now analyze their 
runtime behavior. 

Theorem 1 The total running time of a complete run of 4M is 0{n^). 

Proof (Sketch). 4M basically consists of three steps: 

1. Computing the subdivision into stripes which can be done in time 0(n-log n) 
by using a sweep-line method. 

2. Finding a line which takes linear time for performing a variant of dfs. 

3. Redrawing the graph which is obviously a linear-time procedure. 

But we do not have to recompute the data structure after each step from scratch. 
We maintain the stripes as balanced trees where the corresponding objects are 
stored. Inserting and deleting into this data structure can be done in logarithmic 
time, thus the time necessary before we can start to find the next line is very 
short. If we assume that our initial drawing only had a linear number of bends 
(most orthogonal drawing algorithms guarantee such a bound) we have a linear 
bound for the number of actions each of them requiring linear time. 



4.2 Fast and Good Variants of 4M 

As mentioned earlier the 4M-algorithm is designed for practical use rather than 
to establish new theoretical results. Running 4M without time limits leads to 
similar bounds of the running-time as the good algorithms themselves. But 4M 
can easily be implemented not to find quasi-optimal solutions, but still to find 
good solutions while guaranteeing a short running-time. The main observation is 
that almost all (matching-, morphing-, merging-) lines have a very short length 
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in practical examples. Thus looking for lines consisting of at most k segments 
and bounding A: by a constant (e.g. k = 10) leads to almost the same results as a 
complete run of 4M, but only needs constant time per operation (plus updating 
the data structure). Moreover the parameter k can be determined by the user. 

Using this strategy one matching-, morphing- or merging-step can be performed 
in constant time. We cannot use this technique for Moving because the moving- 
line has to traverse the complete drawing in one dimension; thus its length cannot 
be bounded by a constant. But we can be tricky here though: First we replace 
intermediate Movings (being necessary for example to move a bend closer to a 
vertex in order to perform Morphing which requires unit distance between vertex 
and bend) by local operations (that do not save space); these steps again can be 
performed in constant time. Further we restrict ourselves to do ‘real’ Movings 
at the very end of the algorithm and only compute monotonous lines. They can 
be computed simultaneously in a right-first dfs manner in linear time and we do 
not lose much quality, although being unable to find all possible moving- lines. 
Implementing all these ideas leads to algorithm Quick-4M: 



Theorem 2 Quick-4M can he implemented to run in 0{n • logn) time. 



Experiments show that the results of Quick-4M have almost the same quality as 
the drawings computed by 4M. 

The main drawback of 4M is that there is no natural order how to apply the op- 
erations. Indeed the order is very important: Applying one method can destroy 
the necessary conditions for other improvement steps. If we are willing to spend 
more time we can formulate the problem as a max-flow-problem in a correspond- 
ing network, with the purpose of finding the maximum possible progress instead 
of any local progress. By the augmenting paths technique we can ‘reverse’ some 
actions, and find another ‘order’ which gives better results. 

The runtime of this procedure is 0(n^ logn) which is not much worse than 
performing a complete 4M and we can expect to improve the quality of our 
drawings. 

Another way of improving the algorithm is to be smart in the choice of the 
vertex (resp. bend) that we are dealing with. So far we try any vertex to find a 
suitable candidate for the starting point of a line. If we are unlucky, the ‘good’ 
candidates come at the end. To avoid this situation we can start our search at 
all vertices simultaneously; as soon as one line J has been found (the shortest 
one), we block the region inside of J from being entered by other lines. This 
improves the efficiency of the algorithm. 

These approaches can only be hints how to improve the drawings further and 
require more research and experiments. 
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5 Conclusion 

We have presented four algorithms to modify a given orthogonal drawing. Table 
n shows how these operations can change the size of the vertices, the total area 
of the drawing and the number of bends. A ’+’-sign means an improvement with 
regard to the corresponding criterion, a ’-’-sign a possible worsening, an ’=’-sign 
no change. 



Table 1. How the methods can change the drawing. 





vertex 

size 


area 


bends 


Moving 


= 


-f 


= 


Matching 


-f = 


+ = 


-f 


Morphing 


-f = - 


+ = 


-f 


Merging 


-f 


-h 


= 



Even with the realization of the max-flow-network indicated in the previous 
section, it is still an interesting open problem to determine a good order how to 
perform the four methods. At least at the very end, a moving-operation should 
be performed because a drawing that still allows moving-steps often contradicts 
human users’ aesthetic feeling (see e.g. Fig. 0. 

The remaining question is how to compute an initial drawing for 4M. The best 
results can be obtained when there are not many bends on a single edge, whereas 
the total number of bends is not so important. Thus the staircase-algorithm 
(presented in |E|) is an ideal start for a drawing algorithm which uses 4M. For 
the staircase algorithm guarantees drawings with one bend per edge for planar 
graphs and the usual methods can be applied to extend it for nonplanar graphs. 

Fig. El and m show an impressive example for the power of 4M: First an initial 
drawing computed by the staircase algorithm is shown. Then the same graph 
after applying 4M to the result of the staircase algorithm can be seen and the 
same graph drawn by the provably ‘good’ algorithm Kandinsky 0. 4M clearly 
outerperforms the Kandinsky-result. 

Summarizing the results of this paper we state that (by combining 4M with 
the staircase-algorithm) we have for the first time a fast algorithm that com- 
putes orthogonal drawings pleasant for a human observer. The secret of 4M is 
that the algorithm tries to apply exactly the operations that a person would 
perform intuitively. Further investigation and research in this direction are cer- 
tainly necessary. For a more detailed description, practical results and a complete 
documentation of the implementation see [Dj. 
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Fig. 10. An orthogonal drawing produced by the staircase algorithm. 





Fig. 11. Orthogonal drawings of the graph of Fig. E] by the Kandinsky algo- 
rithm 

(left side) and produced by 4M starting with the drawing of Fig. (right side). 
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Abstract. In this paper, we present an object-oriented design and im- 
plementation of the core steps of the GIOTTO algorithm for orthogonal 
graph drawing. Our design is motivated by the goals of making the al- 
gorithm modular and extensible and of providing many reusable compo- 
nents of the algorithm. This work is part of the jdsl project aimed at 
creating a library of data structures and algorithms in Java. 

1 Introduction 

In the last few years, there has been an increasing interest in applications of 
software engineering concepts, such as object-oriented programming and design 
patterns, to the area of design and implementation of data structures and algo- 
rithms (see, e.g. P3HH]). 

Traditionally, algorithms have been implemented in a way that would max- 
imize their efficiency, which frequently meant sacrificing generality and exten- 
sibility. The area of algorithm engineering is concerned with finding ways to 
implement algorithms so that they are generic and extensible. One of the pro- 
posed concepts is that of an algorithmic pattern HE]. Similar to the design pat- 
terns 0 in software engineering, algorithmic patterns abstract and generalize the 
algorithms and data structures of common use. They allow the programmer to 
implement new algorithms by extending already existing ones, thus reducing the 
time spent coding and debugging. In many cases, algorithmic patterns abstract 
out the core of several similar algorithms, and the programmer only has to im- 
plement the extensions of the core that are needed for a specific algorithm. The 
object-oriented approach to algorithm implementation incurs some overhead due 
to the indirection, and yields implementations that are slower than the ad-hoc 
ones. For applications where fast running time is essential, the object-oriented 
approach is intended to be used for rapid prototyping of new algorithms, which, 
once fully designed, can be efficiently implemented using traditional methods. 

In this paper, we describe an object-oriented design and implementation of 
the core steps of GIOTTO drawing algorithm m, orthogonalization and com- 
paction, which construct a planar orthogonal drawing of an embedded planar 
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96-1-0013 and by the National Science Foundation under grants CCR-9732327 and 
CDA-9703080 



S.H. Whitesides (Ed.): GD’98, LNCS 1547, pp. 138-^^21 1998- 
(c) Springer- Verlag Berlin Heidelberg 1998 



Algorithmic Patterns for Orthogonal Graph Drawing 



139 



graph with the minimum number of bends m- We present a new algorithmic 
pattern called algorithmic reduction, which provides conversion among different 
types of data structures. Reductions are often used to convert data structures 
into the form required by certain algorithms. GIOTTO is particularly suitable as 
a case study for the use of algorithmic reductions because it consists of many 
steps where the original graph is converted into various other structures (e.g., a 
flow network). 

Our implementation of GIOTTO is part of the GeomLib project PSl Geom- 
Lib is a reliable and open library of robust and efficient geometric algorithms 
written in Java. It is part of a larger project, named jdsl umiiii, aimed at con- 
structing a library of algorithms and data structures using the Java programming 
language. 

The rest of this paper is organized as follows. In Section 0 we present a 
brief description of the jdsl objects used in our implementation of GIOTTO, 
especially the concept of decorations. In Section 0 we describe how algorithms 
are implemented in jdsl and introduce the concept of algorithmic reductions. 
The object-oriented design of GIOTTO is described in Sectional In Section 0, we 
compare our implementation with two other implementations of GIOTTO. 

2 JDSL Structures 

Data Structures for Graphs jdsl contains three different graph structures 
represented by the interfaces Graph, OrderedGraph, and EmbeddedPlanarGraph. 
All graphs are “mixed graphs,” that is, they can contain both directed and 
undirected edges at the same time. The interfaces provide methods for dealing 
with the different types of edges both together and separately. 

The Graph interface models a graph as a combinatorial object, that is a 
container of vertices and edges. The Graph interface provides methods for adding 
and removing vertices and edges, as well as methods for examining the graph 
structure. 

The OrderedGraph interface inherits from the Graph interface and describes 
a graph as a topological structure by adding information about the ordering of 
the edges around each vertex. OrderedGraph provides additional methods that 
manipulate the topological information. 

The EmbeddedPlanarGraph interface describes an ordered graph whose or- 
dering of the edges around each vertex is induced by a planar embedding of 
the graph (hence it extends OrderedGraph). In addition to storing vertices and 
edges, EmbeddedPlanarGraph also stores the faces of the embedding. It provides 
additional methods that access various information about the faces of the em- 
bedding. 

Decorations Frequently, in implementing algorithms, it is convenient to asso- 
ciate extra information, either permanent or temporary, with the elements of 
the algorithm’s input. In graph algorithms, the elements that we want to aug- 
ment with extra information are vertices, edges, and faces of the input graph. 
This extra information, called decorations or attributes, can be used as tempo- 
rary scratch data (for example, to mark vertices of a graph as visited during 
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a traversal algorithm), or to represent the output (for example, to store geo- 
metric information in a graph drawing algorithm) . In writing the pseudocode of 
an algorithm, decorations are used implicitly (we often say “mark vertex v as 
visited”), and it is left to the programmer to determine exactly how they should 
be implemented (for sample implementations see, e.g ESI El). 

In the JDSL framework, each vertex, edge, and face contains a hash table for 
storing its decorations. Each decoration has a key (which is its key in the hash ta- 
ble) and a value, both represented by a Java Object. The interfaces Vertex, Edge, 
and Face extend the interface Decora ble, which provides methods for creating, 
deleting, and setting the values of decorations, as well as accessing all decora- 
tions of the object at once. The use of decorations is intuitive, since for each 
action that may potentially be performed on a decoration, the Decora ble inter- 
face provides a corresponding method. Thus, most manipulations of decorations 
require only a single method call. 

3 Algorithms and Reductions 

Algorithms as Objects In the JDSL framework, algorithms are modeled by 
objects instead of just procedures (an approach also discussed in (ZJ). This ap- 
proach presents the advantage of being able to store within the algorithm its 
input and output, as well as other information such as auxiliary variables and 
data structures used by the algorithm. In other words, this approach gives a 
state to the algorithm. An algorithm object is created with an instance of its in- 
put, provides a way to compute its output (frequently right in the constructor), 
and a variety of accessor methods that can be used to examine its state once the 
computation is finished. 

As an example, consider again an algorithm that performs a depth- first 
traversal starting at a given vertex. The main output of this algorithm is a 
depth-first tree corresponding to the traversal. Thus, the algorithm object will 
provide a method that can be used to examine the tree once it is computed. 
However, in the course of computing the tree, the algorithm uses internal mark- 
ing to indicate that a given vertex has been visited. Usually, this marking is 
only temporary (the data structure that stores it is local to the algorithm pro- 
cedure). If an algorithm is modeled as an object, the marking can be stored 
inside the object and will remain valid even after the computation is finished. 
The algorithm object will provide a method to access this internal data, which 
can be used, for example, to determine whether a given vertex belongs to the 
connected component of the start vertex. The algorithm’s state persists as long 
as the algorithm object does (until it is deleted or garbage collected), and can 
be examined any number of times without having to be recomputed. 

Modeling algorithms as objects and storing the results of their computation 
as state allows the same algorithm to compute several different results. In the 
course of computing a DFS tree, the depth-first traversal algorithm can also 
classify edges as discovery and back, compute the start and finish times for each 
vertex, etc. This information is computed as a side-effect of computing a DFS 
tree, but if it is stored inside the algorithm object, it allows the object to be used 
for purposes other than just traversal. In fact, the user who is only interested in 
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whether a given edge is discovery or back need not be aware of the fact that the 
algorithm object computes other information as well. The user can instantiate 
the algorithm object with the desired input, and use the accessor method that 
returns the type of each edge, ignoring any other accessors the object may have. 

Another advantage to modeling algorithms as objects is the ability to define 
relationships among different algorithms. An algorithm can extend another and 
specialize some of its parts, several algorithms can implement the same interface 
or extend the same base class, or an algorithm can use another as a subcompo- 
nent. 

Reductions In this section, we describe the pattern of algorithmic reduction. 
Often an algorithm can be implemented by making some alterations to its input, 
using another algorithm on the modified input, and then undoing the modifi- 
cations and interpreting the algorithm’s results to provide the answer to the 
original problem. When this is the case, all too often the transformations and 
the algorithms are clumped together, preventing reuse of any of the components. 
A reduction is an object responsible for the first and the last step of the computa- 
tion described above, that is for transformation of the input to fit an algorithm’s 
requirements and for undoing the alterations and interpreting results. 

A reduction, just as an algorithm, is modeled by an object. A reduction object 
is created with an instance of its input and proceeds in two directions, forward 
and reverse. In the forward direction the reduction alters its input in a specified 
way, often to make it fit the input requirements of some algorithm. Once an 
algorithm has been run on the modified input, the reduction undoes the changes 
to its input and possibly transforms the output of the algorithm to provide 
an answer to the original problem that used that reduction. Thus, a generic 
Reduction object has two methods forward() and reverse() and its functionality 
is to report an error when reverse() is called before forward() (since in that case 
the reduction does not make sense). The subclasses of this class define these 
methods to carry out the specific transformations. 

Reductions and algorithm objects are frequently combined together as com- 
ponents of other algorithms. By dividing the components of an algorithm’s im- 
plementation into reductions and algorithms, we clearly separate the transforma- 
tion and computation steps (which usually alternate). In this scheme, algorithms 
should never modify the structure of their input, all modifications are done by 
the reductions so that they can be undone later. When we look at an imple- 
mentation of an algorithm, we can easily identify which steps modify the input 
and which just perform computations, since they are modeled by fundamentally 
different objects. 

4 GIOTTO Implementation 

In this section, we show how using the jdsl components and algorithmic patterns 
described above, we can create a simple, modular, and extensible implementation 
of the core steps of the GIOTTO algorithm: orthogonalization and compaction. 

Our implementation consists of two main components each modeled by an 
algorithm object. The first component is the orthogonalization algorithm, which 
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constructs an orthogonal representation of a given embedded planar graph. In the 
current implementation, this algorithm accepts only embedded 4-planar graphs 
(embedded planar graphs whose vertices have degree at most four), however we 
plan to add a reduction from general embedded planar graphs to embedded 4- 
planar graphs, which would allow this algorithm to operate on any embedded 
planar graph. The second part of the implementation is the compaction algo- 
rithm, which accepts as input a 4-planar graph and its orthogonal representation 
and produces a planar orthogonal drawing where each vertex is represented by 
a point, and each edge by a chain of vertical and horizontal segments. Although 
these algorithm objects were developed as parts of the overall GIOTTO imple- 
mentation, their modular design allows them to be used independently, which 
means they can be reused as parts of other algorithms. 
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Fig. 1. Inheritance hierarchy of the reductions used by our implementation of 
the GIOTTO algorithm. Abstract classes are represented by shaded boxes. 



Orthogonalization Algorithm The orthogonalization algorithm m accepts 
as its input an embedded 4-planar graph with a fixed external face and produces 
an orthogonal representation of the graph with the minimal number of bends. We 
briefly review the algorithm following the description in ^ . For each (undirected) 
edge with endpoints u and v, call the two possible orientations (u,v) and (v,u) 
darts. The orthogonal representation of a graph is defined by assigning to each 
dart values a and (3 defined as follows: 

— a{u, v) ■ 'K is the angle at vertex u formed by the first segment of this dart 
and the next dart counterclockwise around u; 

— P{u, v) is the number of left turns of value tt/2 that are made when traversing 
the dart from origin to destination. 

The implementation of the algorithm is broken down into several objects that 
model the computational steps above. 

Orthogonalize This object models the orthogonalization algorithm itself. Its 
input is an instance of EmbeddedPlanarGraph and a designated external face 
which are provided to the object’s constructor (where the computation of 
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the orthogonal representation is performed). Orthogonal representation is 
modeled by associating with each edge an array of two Dart objects (with 
computed a and j3 values) using a decoration. The key to this decoration 
as well as all intermediate data generated by the computation (e.g. a flow 
network and the flow information) can be retrieved from the object using 
the accessor methods. 

FlowReduction (see Figure 0 This subclass of Reduction is responsible for con- 
verting an EmbeddedPlanarGraph into a flow network (modeled by an object 
of type Graph) and then interpreting the results of the minimum cost flow 
algorithm to compute an orthogonal representation. In the forward() method 
of the reduction, the flow network is constructed piecewise by each dart. The 
reverse() method, called once a flow algorithm has decorated each edge of 
the network with the amount of flow in it, computes the orthogonal repre- 
sentation by calling an appropriate method of each dart that interprets the 
flow and computes a and P values. 
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Fig. 2. Interaction diagram of the orthogonalization algorithm 



The interaction diagram of the orthogonalization algorithm is shown in Fig- 
ure 121 The computation, which is done in a separate protected method called 
from the constructor, proceeds by creating an instance of FlowReduction and 
calling its forward() method to build the flow network. The flow computation in 
the resulting network is done in a separate method. This provides the flexibility 
of allowing the user to redefine the minimum cost flow algorithm that is used. 
We provide a default implementation, but using another implementation is easy, 
all that is necessary is subclassing the Orthogonalize class and redefining the 
computeFlow method. All information necessary for the computation is passed 
to this method so that the programmer need not know the internal state of the 
algorithm object. This is an example of the template method pattern jOJEl that 
often comes up in the object-oriented design of algorithms. 

In order to make Orthogonalize fully functional, we provide an implementa- 
tion of computeFlow method using the cycle-annealing algorithm described in [IJ. 
The algorithm assumes that the edges of the network do not have a lower bound, 
so a reduction is used to transform the flow network into the form required by 
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the algorithm. The forward() method adjusts the production of vertices and ca- 
pacities of edges so that the lower bound can be removed. The reverse() method 
adjusts the computed flow to correspond to the correct flow in the original net- 
work. 

Once the lower bound is removed, we can use the cycle-annealing algorithm 
to compute the minimum cost flow. This algorithms uses several other objects 
as subcomponents. The initial flow in the network is computed by an object 
modeling the Ford and Fulkerson augmentation algorithm. The flow is stored 
as a decoration of the edges of the network. This flow algorithm operates on a 
single-source, single-sink flow network, so a reduction described in Section 0 is 
used to convert the network produced by the FlowReduction into the required 
form. The negative cost cycles are computed by a subclass of a Bellman-Ford 
algorithm object. 

Compaction Algorithm The compaction algorithm m takes as its input 
an embedded planar graph and its orthogonal representation and produces a 
drawing of the graph by associating a point with each vertex and a chain of 
vertical and horizontal segments with each edge. We briefly review the algorithm 
following the description in 

The algorithm first transforms the input graph into a graph whose faces all 
have rectangular shape by adding fictitious vertices at edge bends and decom- 
posing the non-rectangular faces into rectangles by means of fictitious edges. 
The algorithm then computes the length of the edges in the resulting graph. 
There are several ways to compute edge lengths, so the algorithm’s design again 
uses a template method to allow the user to specialize the computation. The de- 
fault implementation computes the lengths of the vertical and horizontal edges 
separately using an optimal weighted topological numbering algorithm. To com- 
pute the lengths of the vertical edges, the algorithm orients each vertical edge 
from top to bottom, condenses maximal runs of horizontal edges into vertices 
and computes an optimal weighted topological numbering of the resulting planar 
st-graph with respect to unit edge lengths. The length of a vertical edge is set 
to be the difference between the topological numbers of its two endpoints. The 
lengths of the horizontal edges are computed analogously. The final step of the 
algorithm generates a geometric object for each vertex and edge of the input 
graph. 

The compaction algorithm is modeled by the CompactOrthogonal algorithm 
object. The computation of the drawing can be broken up into three stages: the 
refinement of faces into rectangles, performed in the forward() method of the 
RectangularizeFacesReduction, the length computation, performed in the com- 
puteLengths method of CompactOrthogonal class, and the assignment of geomet- 
ric objects, performed in the reverse() method of the reduction. The interaction 
diagram of this algorithm is shown in Figured 

We will first describe the objects used in the computation of edge lengths. 

CompactEdgesReduction (see Figure [Q This abstract subclass of the generic 
Reduction takes as its input an EmbeddedPlanarGraph and its orthogonal 
representation in the form of Dart objects attached to the edges. The for- 
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ward() method of the reduction produces a planar st-graph by condensing 
some edges into vertices, and orienting other edges in a given direction. The 
discriminators for identifying which edges should be condensed and which 
oriented are provided by the subclasses of this class. The reverse() method 
anticipates that the vertices of the st-graph have been decorated with their 
topological numbering and uses the numbering to compute the lengths of 
the edges of the original graph. The lengths are stored as decorations of the 
edges. 

UnitWeightedTopNumbering This objects models an algorithm for computing 
optimal weighted topological numbering of a digraph. The algorithm stores 
the numbering of each vertex as a decoration and provides a method to 
access the decoration’s key. 

The computeLnegths method proceeds by creating an instance of CompactVer- 
ticalEdgesReduction and calling its forward() method to obtain a graph corre- 
sponding to the input graph with contracted runs of vertical edges. In the next 
step, an instance of UnitWeightedTopNumbering is constructed using the graph 
obtained from the previous step as its input, the execution of this algorithm 
decorates the vertices of the graph with their topological numbers. The reverse() 
method of the reduction is called next, it uses the just computed numbering dec- 
oration to compute lengths of the horizontal edges and store them using another 
decoration. Analogously, vertical edge lengths are computed using a CompactHo- 
rizEdgesReduction and another instance of UnitWeightedTopNumbering. 

We now turn our attention to the refinement procedure. In order to transform 
the faces of the graph so that they all have rectangular shape, it is necessary to 
add fictitious vertices at edge bends and split non-rect angular faces into rectan- 
gular components with fictitious edges. These operations present a book-keeping 
complication. In the JDSL implementation of graph structures, when an edge is 
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split with a vertex, two new edges are generated, while the original edge becomes 
invalid. A similar situation occurs when a face is split with an edge. We would 
like to implement a reduction that would modify the original graph by splitting 
some of its components, but once reversed give back the unaltered graph. In 
order to do this, however, we may have to keep track of the edges that were 
split and insert them back when the reduction is reversed. Such book-keeping 
is rather cumbersome to implement. Instead, we propose an alternative strat- 
egy. Whenever a reduction destructively modifies its input graph, it creates an 
exact copy of that graph first and makes the alterations on the copy. Mapping 
between the elements of the original graph and the elements of the copy can 
be done through decorations. Each vertex, edge, and face of the original graph 
is decorated with the corresponding element of the copy, and vice versa. When 
the reduction is reversed, instead of trying to patch up the modified graph, it 
uses the information computed for the copy (which usually takes form of decora- 
tions) to determine the correct output for the original graph. Once all necessary 
information is transfered from the copy to the original, the copy graph can be 
thrown away (deleted or garbage collected) . 

Operations that slightly alter the structure of a graph to make it satisfy a 
given criterion are quite common in graph drawing algorithms. In fact, the need 
to model such operations generically was the main motivation for developing 
the pattern of algorithmic reductions. We have developed a hierarchy of graph- 
modifying reductions (see Figure 0 that reflects the different situations where 
these reductions can be used. 

CopyingReduction This subclass of the Reduction class is the parent of all reduc- 
tions that act on instances of Graph interface and destructively modify their 
input. The class provides a method which creates an exact copy of the re- 
duction’s input graph. The reduction also provides keys to two decorations: 
mapping from the elements of the original graph to the elements of the copy 
and vice versa. 

GeometricReduction This reduction is the superclass of structure-altering reduc- 
tions used by graph drawing algorithms. The graph objects that it operates 
on are of type EmbeddedPlanarGraph. Subclasses of this reduction are usu- 
ally structured as follows: in the forward () method the input graph is copied 
and altered. The reverse() method computes the geometric shapes for the 
elements of the input graph based on the information computed by an al- 
gorithm acting on the copy of the original graph. Geometric information is 
stored as decorations of the elements of the input graph. 

RectangularizeFacesReduction, which performs the refinement procedure in 
the GompactOrthogonal algorithm is a subclass of GeometricReduction since in 
its forward step it makes alterations to the graph structure, and in the reverse 
step it computes geometric information for the graph. The reduction proceeds 
by making many elementary changes to the graph structure, such as splitting 
an edge or a face. Each of these changes can in turn be modeled as a separate 
object, with several of such objects combined to form the complete reduction. 
Thus, another subclass of GeometricReduction is GeometricReductionStep, which 
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models an alteration of one element (vertex, edge, or face) of a graph structure. 
Subclasses of this class operate on a copy of the original graph (since the changes 
they may make destructive changes), but they may either use a copy provided 
to their constructor, or create one of their own. Each GeometricReductionStep is 
created with a graph element that it will modify in its forward () method and pos- 
sibly decorate with its geometric representation in the reverse() method. GIOTTO 
implementation uses the following subclasses of the GeometricReductionStep 

RemoveBendsReduction This reduction models removal of bends from a single 
edge of a graph. In the forward() method of the reduction the correspond- 
ing copy edge it split by inserting a “dummy” vertex for each bend of the 
edge. The reverse() method of the reduction decorates the input edge with 
its geometric representation — a chain of vertical and horizontal segments. 
The reduction uses the information about the length of each edge, which 
was computed between the calls to forward() and reverse() to compute the 
geometric representation. 

SplitFaceReduction The input of this reduction is a face of the graph which 
is not rectangular. The forward () method of the reduction decomposes the 
corresponding face of the copy into its rectangular components by splitting 
it with several edges. The reduction determines the locations for the split- 
ting edges by traversing the face counterclockwise performing a split every 
time a right turn is encountered. Since a face does not have a geometric 
representation in the drawing reverse() method of this reduction is empty. 

ExpandVertexReduction This object will be used as a subcomponent of the re- 
duction object to convert a general embedded planar graph into 4-planar. 

In its forward() method RectangularizeFacesReduction creates an instance of 
RemoveBendsReduction for each edge that has bends, and an instance of SplitFac- 
eReduction for each non-rectangular face. The reduction steps are constructed 
with the copy of the original graph created by the RectangularizeFacesReduction, 
since there is no need to copy the graph for each little alteration. Executing 
forward 0 methods of all reduction steps refines all faces of the copy graph into 
rectangles. In the reverse() method, the reduction creates geometric representa- 
tions of the vertices of the graph (since no vertex refinement was done), undoes 
the step reductions, which creates geometric representations for the edges with 
bends, and finally creates geometric representations for the remaining edges. 
Since all alterations were made on a copy, there is not need to make any modi- 
fications to the input graph. 

5 Design Evaluation and Comparison 

Graph Hierarchy jdsl does not make a detailed classification of graphs and 
their drawings based on their specific properties. All types of graphs are modeled 
by the Graph interface if they contain only combinatorial information, and by 
EmbeddedPlanarGraph interface if they also contain the topological information. 
Thus, in the jdsl framework there are no special classes for a DAG, a biconnected 
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graph, a 4-planar graph, etc. A DAG, for example, is just a Graph which happens 
to have only directed edges and no directed cycles. 

A design which models each graph type which has some special proper- 
ties with a separate class is also possible. This approach is used in the design 
of the GDToolkit ^3 graph drawing package (the successor to Diagram 
Server |3|^). GDToolkit provides a hierarchy of graph classes, where addi- 
tion of special properties or structure is modeled through inheritance. For exam- 
ple, an orthogonal planar undirected graph adds to a planar undirected graph 
information about the number and angles of bends, and is, therefore, modeled 
as its subclass. 

In the JDSL framework, the conversion between different types of graphs 
does not require creating new graph objects, just modifying existing ones. For 
example, to create a directed graph from an undirected one, all that is required 
is to set the direction of the edges using, e.g., the setDirectionFrom(Edge, Vertex) 
method of the Graph interface. If a directed graph is modeled by a separate class, 
as it is done in GDToolkit, a new object has to be created, using the original 
graph as a parameter in the conversion constructor. The drawback of creating a 
new object is the fact that a given graph cannot be made directed, a conversion 
constructor will create a graph which is a directed copy of the original. This 
presents a difficulty since any objects referencing the vertices and edges of the 
original graph will have to be updated to reference the elements of the new one 
or a mapping between the new and old elements will have to be created. 

On the other hand, not having separate classes for the special types of graphs 
complicates error checking. In jdsl, since there is no type for a digraph, an 
algorithm that operates on digraphs takes just a Graph as its input. It then 
has to check at runtime that its input is correct, that is that the graph only 
has directed edges, and report an error if that is not the case. If a digraph is 
implemented as its own type, however, any algorithm that accepts an object of 
that type can assume that it is inherently correct, and thus does not have to do 
any error checking of its input. 

Decorations As described in Section 0 decorations are very useful in imple- 
menting various graph algorithms. There is a number of ways to implement dec- 
orations, but to provide a good implementation the following guidelines should 
be adhered to: (F) Decorations should be easy to use. A procedure for identifying 
and manipulating a decoration that belongs to a vertex, edge or face of a graph 
should be easy and intuitive. (ii.jThe number of decorations that can be added 
to a graph element (vertex, edge or face) should not be limited, otherwise that 
element will not be extensible. 

JDSL implements decorations by associating with each element of the graph 
a hash table for storing its decorations. This scheme satisfies both conditions, 
since any number of decorations can be stored in an element’s hash table, and 
accessing decorations requires just a method invocation. 

An alternative implementation is used in the leda library jl t)| . which pro- 
vides decorations for graphs and planar maps through node-, edge-, and face- 
arrays. A node-array stores the decorations for the vertices of a graph in a G-l — h 
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vector and uses the internal numbering of the vertices to access decorations of 
a given vertex. The edge and face arrays are implemented in similar fashion. 
This scheme also satisfies both conditions above since creating new decorations 
just requires making more vectors, and the mapping between elements and their 
decorations is done internally, so that the user can easily access the necessary 
decorations. 

JDSl’s implementation has one advantage over leda’s — the decorations 
and the decorated objects are tightly coupled, which makes it easier to imple- 
ment several actions. In the leda implementation, it is difficult to access all 
decorations of a given object, since they may be spread out through the entire 
implementation of the algorithm. In the JDSL implementation, the Decorable in- 
terface provides the method attributes, which returns an enumeration of all the 
decorations of that object. Coupling decorations and the decorated objects also 
provides more fine grained control over the access to the decorations. In leda 
implementation, any object that has access to, e.g., a node-array can modify a 
decoration of any vertex. In jdsl implementation in order to access an object’s 
decoration, one needs a reference to that object first. 

The JDSL system of decorations is not limited to just graph data structures. 
Nodes of sequences and trees can also be decorated, which simplifies implemen- 
tation of such data structures as red-black trees, where a node’s color can be 
stored in a decoration. 

GIOTTO Implementation There are currently several other implementations 
of GIOTTO, provided as parts of graph drawing packages (see, e.g., tanoiii) 
In this section we compare some of the features of two such implementations, 
provided in Graph Drawing Server and GDToolkit m, with our im- 
plementation described in Section 0J We will examine each implementation with 
respect to the following criteria. Modularity: How the algorithm is broken up 
into subcomponents. Extensibility: How easy it is to make modifications to the 
algorithm and extend its functionality. Ease of use: How clearly defined is the 
interface through which the algorithm should be used. Use of graph structures: 
Which graph structures the implementation uses and how it uses them. Alter- 
ations to the input: Several steps of GIOTTO make modifications to the input 
graph such as adding edges and vertices which need to be removed in the final 
output. We examine how each implementation adds the new elements, keeps 
track of them throughout the execution, and removes them. 

The Graph Drawing Server | 3 | is a web-based graph drawing and trans- 
lation facility. The user submits a graph in one of a number of formats, selects a 
drawing algorithm to be used and specifies an output format with a request to 
the server. The implementation of GIOTTO that is part of the server is not meant 
for use outside the server, and therefore has several application-specific features. 
Modularity: The component breakdown is limited to placing major algorithms 
into separate procedures. Extensibility: This particular implementation is hard 
to extend, since many of its components were developed specifically for use by 
the GIOTTO algorithm and were not meant to be reused for other purposes. 
Ease of use: Using the GIOTTO implementation requires calling several functions 
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corresponding to the steps of the algorithm in order. A unifying function for 
GIOTTO algorithm is not provided. Use of graph structures: This implementa- 
tion of GIOTTO uses a special data structure to keep track of the graph and other 
information. The data structure is highly specific to the implementation. Alter- 
ations to the input: Fictitious vertices and edges are added directly to the main 
data structure and labeled as such. In the stage when the algorithm computes its 
final output, the labels are examined and elements that are labeled as fictitious 
simply do not get included in the output. 

GDToolkit |I2I provides an implementation of several graph drawing algo- 
rithms. As a consequence, it has a supporting library of graph data structures, 
and also uses several leda components. Although the main purpose of the pack- 
age is for displaying graphs, it is developed in such a way as to make it possible 
to add new algorithms. Modularity: The library provides several graph struc- 
tures, each modeled by a class. The implementation of GIOTTO, however, is not 
separated from the graph class that it is applied to. Also, algorithmic subcom- 
ponents (such as the minimum cost flow computation) are not separated from 
the main GIOTTO implementation. Extensibility: The implementation of GIOTTO 
is a private function of the embedded planar graph structure, which means that 
extending its functionality, even through inheritance, is not possible. Therefore 
it is difficult to use the algorithm for anything other than computing a drawing 
and displaying it on the screen. Ease of use: Using the GIOTTO algorithm in 
GDToolkit is very easy. To create an orthogonal representation for a given 
graph instance, that instance needs to be assigned to an object of type orthog- 
onal planar graph. The assignment automatically performs the conversion and 
runs the orthogonalization algorithm. Use of graph structures: GDToolkit pro- 
vides several graph structures to be used by graph drawing algorithms. One of 
the classes is used to model a drawing as a graph which contains information 
about the coordinates of its elements and can be drawn in a window. GIOTTO 
proceeds by first constructing a drawing object with an unspecified layout, which 
is a copy of its input graph and then creating an orthogonal representation for 
the drawing. Alterations to the input: This implementation of GIOTTO uses a 
marking system similar to the one used in the Graph Drawing Server to in- 
dicate fictitious elements. The object that models a drawing provides a method 
to remove the fictitious elements that can be called before displaying a graph. 

The main function of both the Graph Drawing Server and of GDToolkit 
is to apply a variety of drawing algorithms to graphs provided by the user, and 
to display the resulting drawing. These packages are not intended to be used as 
libraries for programmers who want to develop new algorithm implementations. 
As a consequence, many of their components are application-specific and thus 
are difficult to reuse outside the package. 

The implementation of GIOTTO presented in this paper is intended to be used 
in a variety of applications. It is therefore aimed at providing components that 
are reusable and can be easily adapted to the required application. Modularity: 
Each object and reduction in the implementation of GIOTTO is modeled by a 
separate object with a well-defined interface. Algorithms are used as subcom- 
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ponents of other algorithms by instantiating them, computing the output, and 
using the desired accessor methods. Extensibility: GIOTTO and its subcompo- 
nents are designed in such a way as to be easily extended for use in various 
applications. All algorithms are modeled as objects, which means they can be 
extended through inheritance. The template method pattern is used when there 
are alternative algorithms for a given action, or when there may be user-defined 
actions taken at a given step. Ease of use: Since the GIOTTO algorithm is modeled 
by an object, using it just requires creating an instance of that object passing 
to it the desired input graph in the constructor. Use of graph structures: Our 
implementation of GIOTTO uses the Graph and EmbeddedPlanarGraph interfaces 
described in Section]^ Alterations to the input: Our implementation of GIOTTO 
does not make any changes to the input data structure, it only adds decorations. 
Any alterations that are done to the input are handled by the reduction objects 
which have the ability to undo any changes they make. 
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Abstract. We describe a unified framework of aesthetic criteria and 
complexity measures for drawing planar graphs with polylines and curves. 
This framework includes several visual properties of such drawings, in- 
cluding aspect ratio, vertex resolution, edge length, edge separation, and 
edge curvature, as well as complexity measures such as vertex and edge 
representational complexity and the area of the drawing. In addition to 
this general framework, we present algorithms that operate within this 
framework. Specifically, we describe an algorithm for drawing any n- 
vertex planar graph in an 0{n) x 0(n) grid using polylines that have at 
most two bends per edge and asymptotically-optimal worst-case angu- 
lar resolution. More significantly, we show how to adapt this algorithm 
to draw any n-vertex planar graph using cubic Bezier curves, with all 
vertices and control points placed within an 0(n) x 0(n) integer grid 
so that the curved edges achieve a curvilinear analogue of good angular 
resolution. All of our algorithms run in 0(n) time. 



1 Introduction 

One of the main contributions of research in graph drawing has been the formal 
identification of aesthetic criteria that graph drawings can possess together with 
the trade-offs that exist between these criteria and various complexity measures 
of drawings. Such formal specifications allow us to quantify the qualities that 
make a drawing “nice” to look at and studies of various trade-offs further our 
understanding of which aesthetic goals can be realistically achieved. Examples 
of such studies are too numerous to enumerate, but some well-known examples 
include the importance of the drawing area identified by de Fraysseix, Pach, 
and Pollack jSj, the illumination of the value of bend minimization in polyline 
drawings by Tamassia P31, the discussion of aspect ratio by Ghan et al. P, 
and the identification of angular resolution as a signficant aesthetic criterion by 
Formann et al. |S|, and its further study by Kant Hm, Garg and Tamassia 0, 
and Malitz and Papakostas CD. 

* Work by this author is supported in part by the U.S. Army Research Office under 
Grant DAAH04-96- 1-0013 and by NSF under Grant CGR-96-25289. 

** Work by this author is supported in part by ONR grant N00014-96- 1-0829. 
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This paper is directed at continuing this tradition by articulating a unified 
framework for describing aesthetic criteria and complexity measures for draw- 
ings of graphs that place vertices at points in the plane and represent edges with 
smooth curves (that is, curves with at least continuity) or polylines (that is, 
polygonal chains). We review or generalize several well-known graph-drawing de- 
sign goals for this framework, including aspect ratio, vertex resolution, and bend 
minimization, as well as introduce design goals that are not as well-known, such 
as a curvilinear analogues to angular resolution and edge resolution. In addition 
to articulating this framework for curve and polyline drawings, we also describe 
new planar graph drawing algorithms that operate within this framework. 

1.1 Related Prior Work 

Before we describe our framework and our algorithms for this framework, how- 
ever, let us review some related prior work. Specifically, since we reviewed above 
much of the prior work on the identification and study of general aesthetic cri- 
teria, let us review here some of the prior work relating to the drawing of planar 
graphs with curves and polylines. 

One of the now-classic results for drawing planar graphs is an algorithm by 
de Fraysseix, Pach, and Pollack 0 for drawing an n-vertex planar graph in an 
0(n) X 0(n) integer grid using straight line segments to represent edges. Orig- 
inally presented as an 0(n log n) time algorithm, Chrobak and Payne |21 show 
how to cleverly use a tree data structure to reduce the running time of this 
algorithm to 0(n). This algorithm does not achieve good angular resolution, 
however, because edges incident on the same vertex are not necessarily nicely 
“spread out.” But this algorithm introduces an important concept known as the 
canonical ordering of the vertices of a plane graph, which is vital to several other 
graph drawing algorithms, including several algorithms by Kant uni- In particu- 
lar, Kant uses this ordering in an algorithm to draw a planar graph with vertices 
placed in an 0(n) x 0(n) integer grid and edges drawn as polygonal chains with 
at most three bends per edge in order to obtain good angular resolution. For- 
shadowing the topic of our paper, Kant observes that polyline drawings can be 
modified to give drawings with nicely curved edges, by, say, using splines. 

Unfortunately, several issues related to curve drawings are not addressed by 
Kant. For example, if one applies standard methods for “smoothing” polylines, 
there may be no guarantee that edges that previously did not cross will not 
cross in their smoothed form. In addition, there are several complexity issues 
related to polyline smoothing that should be addressed, such as deciding if a 
polyline with three bends is better represented using a single degree-4 curve 
or as two degree-3 curves joined at a point (which is called a knot). Equally 
important is the impact that such a decision has on the beauty of the drawing. 
Indeed, questions such as this are the driving force behind our search for a unified 
framework for characterizing graph drawings that use polylines and/or curves 
to represent edges. 

On the topic of curve drawings themselves much less is known. There are 
several systems (including most general drawing packages) that allow for curves 
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and polyline smoothing, but we have not been able to find much written on 
formal studies of curve drawings. A notable exception to this is work of Gansner, 
Koutsofios, North, and Vo 0 that addresses the issue of curve drawings for 
general graphs, considering aesthetic criteria such as the hierarchical structure 
of the underlying graph, the edge length, symmetry, and the number of edge 
crossings. Their analysis is not as formal as, say, the work of Kant uni for polyline 
drawings, but Gansner et al. nevertheless provide evidence that their algorithm 
runs quickly and produces drawings that compare favorably with previous work. 
Unfortunately, the heuristics upon which their algorithm is based do not easily 
provide concrete statements about the properties of the drawings produced. For 
example, it seems that a drawing of a planar graph produced by this algorithm 
may have edge crossings. Thus, there is a need for algorithms for drawing graphs 
with curved edges so as to guarantee various aesthetic and complexity goals. 



1.2 Our Results 

In this paper we review and generalize several known aesthetic and complexity 
goals to a unified framework for drawing graphs with polyline and/or curve 
edges. Some of the novel aspects of this framework include a design goal we call 
edge separation, which is a curvilinear analogue to the concepts of edge resolution 
and angular resolution used for polyine drawings, as well as a concept we call 
curvature minimization, which provides a curvilinear analogue to the avoidance 
of sharp bends in polyline drawings. 

In addition to the presentation of this framework, we provide algorithms 
that operate within it and for which we can make concrete performance claims. 
Specifically, we give an algorithm to draw an n-vertex planar graph in 0(n) 
time in an 0(n) x 0(n) integer grid using polyline edges that have at most 
two bends per edge and achieve an asymptotically-optimal worst-case angular 
resolution. Thus, we are able to achieve similar goals to a polyline drawing of 
Kant m, but with only two bends per edge instead of three. More importantly, 
by limiting each polyline to two bends, we show how to extend our algorithm to 
draw an n-vertex planar graph in 0(n) time in an 0{n) x 0(n) grid so that each 
edge is drawn as a single cubic Bezier curve (i.e., with no knots) in a way that 
achieves good edge separation and has every vertex and control point placed at 
points with integer coordinates. Before we give the details of these algorithms, 
however, let us describe our unified framework for drawing graphs with polylines 
and curves. 



2 A Framework for Drawing with Curves and Polylines 

The goal of many graph drawing algorithms is to visually represent the essential 
properties of the graph in an intuitive and pleasing drawing. 



156 Michael T. Goodrich and Christopher G. Wagner 



2.1 Aesthetic Criteria 

Based on this primary objective, we consider several formal aesthetic criteria for 
drawing graphs with edges represented as polylines and/or curves. Most of these 
criteria are not new, but are instead either direct translations of known aes- 
thetic criteria to this combined curve/polyline framework or are generalizations 
of known polyline aesthetic criteria so that they also apply to curve drawings. 
In particular, we consider the following aesthetic criteria: 

Aspect Ratio: If we consider the smallest rectangle that encloses a drawing, 
the aspect ratio is the ratio of the longer side of the rectangle to the shorter. 
In most cases, aspect ratios closer to 1 are more desirable. 

Vertex Resolution: The minimum distance between any pair of vertices. Hav- 
ing good vertex resolution allows an observer to quickly distinguish vertices. 
Edge Length: Edges should be as short as possible, so as to allow the eye to 
easily identify connected vertices. 

Edge Separation: Edges should be separated so as to be easily distinguished. 
Our formalization of this notion is a combination of two commonly used 
graph drawing criteria — edge resolution and angular resolution — generalized 
to the curved setting. 

We define an offset region for each edge e, which defines a region around e 
that should not contain any other edges or vertices. For each point p on e, 
the offset from p is defined as all points at distance at most Se{p) from p 
along a line perpendicular to e at p, where Se{p) is a function that depends 
on our drawing goals. (See Figure 



For example, if we desire an angular resolution of at least 0 > 0 and edge 
resolution of at least s > 0, then we would define 



where u is the distance along e from the endpoint of e closer to p. 

Thus, if e is a straight line segment, then the edge separation simply states 
that every edge nonincident to e is at least distance s away and every edge 
incident to e defines an angle of size at least 9. In this typical usage, we refer 
to 9 as the angular parameter and s as the separation parameter. 

Bend/ Control- Point Minimization: For a polyline edge, we would like to 
have as few bends as possible, since many bends tend to make edges more 




Fig. 1. The offset region of an edge e. 



Se{p) = min{u,tan0, s} 
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difficult to follow. Analogously, as the bends in a polyline edge define the 
edge, so too control points define a spline or approximation curve. The more 
control points used to define a curve, the higher the degree of the curve or 
the larger the number of knots needed in a spline representation; hence, we 
wish to minimize the number of control points for curve edges. 

Curvature Minimization: We would like edges to follow a relatively direct 
route between vertices, without sharp turns. The curvature of an edge is a 
measure of how quickly the edge bends. For polyline edges, this is simply 
the angle created at a bend. For curves, the curvature is measured as the 
rate of change of the angle of the tangent vector to the curve at any point. 
This, of course, can be computed directly, but a common approximation of 
curvature is to use the norm of the second derivative of the curve’s defining 
equation. 

These are not the only aesthetic criteria that one might wish to formalize, 
but they form a core set of common goals among wide classes of drawings. 

2.2 Complexity Goals 

In addition to such aesthetic criteria, there are also complexity issues for graph 
drawings, such as the following, which should also be considered: 

Vertex Representational Complexity: Since graph drawing algorithms are 
typically implemented on computers, we should minimize the space needed 
to represent vertices. For example, many graph drawing algorithms impose 
the restriction that vertices be represented with integer coordinates, using 
O(logn) bits each. Such drawings are called grid drawings. Notice that this 
condition also forces vertex resolution to be at least unit distance. 

Edge Representational Complexity: Similarly, we should consider the space 
needed to represent edges in a drawing. For example, in grid polyline or curve 
drawings we require that bend points and control points be represented with 
integer coordinates, using O(logn) bits each. In addition, by achieving good 
bend/control-point minimization, we are also reducing edge representational 
complexity. 

Area: The area of a drawing is defined as the area of the smallest rectan- 
gle that encloses the drawing. We typically desire the area of a drawing to 
be small while not violating our aesthetic goals. Di Battista et al. P) and 
Tamassia P) summarize several known bounds on area-tradeoffs for vari- 
ous aesthetic properties for different types of drawings. For example, 0{n^) 
worst-case area is required for planar polyline grid drawings PQimiii2]. 

Another interesting example of a tradeoff between the aesthetic criteria and 
the complexity measures is the angular resolution of the drawing. Clearly, the 
angular resolution of any graph drawing is at most 2tt / d^ where d is the degree 
of the graph being drawn. But for a planar straight line grid drawing with O(n^) 
area, the angular resolution can be shown to be 0{-^) (e.g., see ^5)- However, 
allowing the edges to be polylines, Kant m shows how to achieve an angular 
resolution of 0(^) in linear time with at most three bends per edge. 
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2.3 Characterizing Our Algorithms in This Framework 

Operating within this unified framework, we give 0(n) time algorithms that 
build on the canonical-ordering approach of the algorithms of de Fraysseix, Pach, 
and Pollack [3 and Kant’s algorithm mu to achieve the following results for grid 
drawings: 

— each edge is a polygonal chain (cubic Bezier curve) with at most two bends 
(non-endpoint control points) per edge, such that each edge is monotonically 
increasing or decreasing in both x and y coordinates, 

— vertices and bend (control) points are located at points in an 0{n) x 0{n) 
integer grid (which implies an 0(1) aspect ratio). 

— the polyline drawing achieves an edge separation with separation parameter 
s = 1 and angular parameter 6 that is 0{l/d), where d is the degree of 
the graph; the curvilinear drawing algorithm achieves these bounds for its 
control curves. 

We begin the presentation of the algorithm by first reviewing the canonical 
ordering of the vertices of a planar graph urn- 



3 Our Polyline Drawing Algorithm 

Let G be an n-vertex plane graph of degree d. In addition, without loss of 
generality, we assume that G is maximal, since we can triangulate the faces of G 
without increasing the degree of any vertex in G by more than a constant factor. 



3.1 The Canonical Ordering 

The following canonical ordering of the vertices of G is due to de Fraysseix, 
Pach, and Pollack P| (and extendeded by Kant P33)- Let a, b, and c be the 
external vertices of G. A canonical ordering of the vertices of G (with respect to 
this embedding) is a labeling v\ = a^V 2 = h,vz, ■ ■ ■ ,Vn = c such that: 

— the graph Gk is biconnected, internally triconnected (the subgraph of Gk 
induced on the interior vertices of Gk is triconnected), and Gk contains the 
edge (ui,U 2 ); 

— vertex Vk+i is on Gk+i and has at least two neighbors in Gk on the path 
Ck — (ui,U 2 ), all consecutive, 

where Gk is the subgraph of G induced on vertices vi,V 2 , ■ ■ ■ ,Vk and Gk is the 
exterior face of Gk- We will refer to the vertices of Ck in order, meaning their 
clockwise order on Ck beginning with v\. Such an ordering can be constructed 
in 0{n) time pi ITnj. 
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3.2 Our Algorithmic Approach 

Our algorithm is a blending of de Fraysseix, Pach, and Pollack’s ideas 0 and 
the algorithm that achieves angular resolution with three bends per edge due to 
Kant HD). The novel aspect of our algorithm is in the development and use of a 
simple structure for each vertex, which we call the join box. 

Let V{G) be the vertex set of graph G, and let d{v) denote the degree of 
vertex v. For each v € V{G), take a square around v with corners d{v) — 1 units 
above, below, left, and right of v. This is the join box associated with v (see 
Figure 01 • This construction is quite simple, yet it is the key to being able to 
reduce the number of bends per edge to two. 



p v,d(v)-l = q V,d(v)-1 




Fig. 2. A vertex v’s join box. 



Notice that if v is located at a lattice point (integer grid point), then there 
are 4d(u) — 4 lattice points located on v’s join box, called join points. We label 
the join points as indicated in Figure 0 

The algorithm adds one vertex to the drawing of G during each phase so that 
the join points of the new vertex can connect to the join points of its neighbors 
via a straight line segment without any edge crossings. The join points will 
become the bends in the polyline edges of the graph, so that edge (u,v) will 
have two bends, one at a join point of u and the other at a join point of v. 

The algorithm proceeds iteratively. At phase k of the algorithm, the 
subgraph of G, Gfc, has been embedded in the plane without edge crossings, 
while maintaining the following invariants: 

1. The vertices and hence the join points of the subgraph Gk are located 
at lattice points. 

2. Let wi = vi,W 2 ,W 3 , . . . , Wm = V 2 be the vertices of the exterior face Gfc of 
Gfc in order and let x{wi) be the x-coordinate of vertex Wi. Then x(wi) < 
x{w2) < . . . < x{Wm). 

3. Let [wi,Wi+i] be an edge connecting join points of Wi and Wi+i. Then 

either has slope +1 or —1, Vi, 1 < i < m. 
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4. For each v G V{Gk), the join points and have been used and all 
of v’s edges to its neighbors in Gk have been drawn. Also, the upper join 
points that have been used are consecutive beginning at and (as in 
Figure E|) • 

5. Each edge is monotonically increasing (decreasing) in both the x and y di- 
rections. 

Notice that the contour of the exterior face Gk should appear as in Figure El 




Fig. 3. Appearance of Gk at any stage of the algorithm. 



Now, consider when the algorithm is in phase fc -|- 1, i.e., ready to construct 
Gk+i from Gk- First, note that as in the work of de Fraysseix, Pach, and Pol- 
lack 0, invariant (2) along with the fact that the sides of the join boxes have 
slopes of -|-1 and —1 imply that any two join points on the exterior face of Gk 
have an even Manhattan distanc^. This property will allow us to initially place 
Vk+i- If we let p{a,b) be the intersection of a line with slope -1-1 through point 
a and a line with slope —1 through point b, where x{a) < x(b) then for any two 
join points, a and b, p{a, b) is a lattice point. Also, notice that invariants (2) and 
(3) imply that the line segments from a to p(a, b) and from b to p(o, b) do not 
intersect Gk- So, we can add vertex Vk+i by finding its leftmost and rightmost 
neighbors on Cfc, choosing suitable join points, a and b, of these neighbors, and 
initially placing the bottom of v’s join box at p(a, 6). Unfortunately, this initial 
placement of Vk+i does not satisfy invariant (4). This issue is resolved by care- 
fully shifting the vertices of the drawing of the subgraph Gk to create space 
for Vk+i, and then inserting Vfc+i along with edges to its neighbors on the outer 
face Gk, while maintaining all four invariants. Invariant (5) will be particularly 
useful when we replace the polyline edges by Bezier curves. 

Let wi = vi,W 2 ,W 3 , - - - , Wm = V 2 be the vertices of the outer face Ck of Gk 
in the order of their appearance from left to right, and let wi and Wr be the 
leftmost and rightmost neighbors, respectively, of Vfc+i on Ck- (Note that the 
neighbors of Vfc+i in Gk form a contiguous sequence of vertices rv/, u>/+i, - - - ,Wr 
on the exterior face Ck of Gk-) Also, let qi^i be the next unused join point on 
the right side of wi's join box and let Prj be the next unused join point on the 

^ The Manhattan distance is the standard li distance. In this case, d(w, v) = 
\x{u) - x(v)| -h \y(u) - y{v)\- 
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left side of Wr’s join box. We will initially place the bottom of Vk+i’s join box 
be the second bend point in edge (wo,Vk+i)- The remainder of the join points 
on the southwest side of Ufe+i’s join box will go to w;+i, . . . , Wo-i and the join 
points on the southeast side of Vk+i’s join box will go to Wo+i, . . . ,Wr-i- This 
will guarantee that edges (wi,Vk+i), ■ • ■ , (wo, Ufc+i) are all monotonically increas- 
ing and edges (u>o+i, Ufe+i), . . . , {wr, Vk+i) are all monotonically decreasing. (We 
need only make sure that Ufc+i’s join points are assigned to its neighbors in such 
a way that there are no edge crossings. That is, if Wi and Wi+i are assigned join 
points a and b respectively, then a;(a) < x(b).) Also, notice that by shifting, since 
we will only increase the x coordinate of any vertex v, and thus any vertices to 
the right of u, we do not destroy the monotonicity of the edges. 

Thus, the key to the algorithm is in determining the sets of vertices to be 
shifted so that edge crossings cannot result from this shifting. 

In the full version of this paper we show how to define shifting sets in 0 {n) 
time so that shifting of vertices will not introduce any edge crossings. In fact, 
this argument allows us to achieve the desired edge separation. To see this, we 
give a brief inductive argument. 

In the base case, this condition holds, since, for example, ui’s join box will 
never intersect the convex hull of edge (^2,^3) because vertices are always shifted 
to the right. We can see that this condition still holds at the inductive step by 
noting that it holds for the darkly shaded region of Figure El by applying the 
inductive hypothesis to Gk and by noticing that the only new edges added are 
incident on Vk+i so that they all are above the darkly shaded region. Hence, 
Ufe+i’s join box does not intersect any nonincident edges, and no other join 
boxes can intersect the convex hulls of the edges incident on Vk+i- Thus, the 
convex hulls of nonincident edges do not intersect, implying the nonincident 
edges themselves do not intersect, since we will be drawing with Bezier curves. 




Fig. 4. The graph after shifting. 



In fact, because the vertices and bend points are located at integer coordi- 
nates and because the convex hull of each edge does not intersect any nonincident 
join box, the convex hulls of the edges have a separation of at least unit distance, 
implying that the edge separation of nonincident edges is at least unit distance. 

It remains to analyze the algorithm. 
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3.3 Properties of the Embedding 

First, it is clear that we have a planar polyline drawing of G, with at most two 
bends per edge. We now examine the angular resolution of the drawing of G and 
the size of the grid on which G is drawn. 

Lemma 1. The size of the minimum angle is where d is the maximum degree 
of any vertex of G. 

Proof. Let v G V (G) have degree d. Then, the minimum angle, 0, occurs between 
a horizontal join segment, and a neighboring join segment. For d < 5, the lemma 
is easily verified. So, assume d> 6. The size of the angle is 0 = arctan(^j^). By 
the MacLaurin series expansion of arctan, we know that for |a;| < 1, arctan(a;) = 
x—^x^+^x^ — jx'^ + . . . > x—^x^. Since < 1, we have 0 > — > 

> 2 , completing the proof. 

Lemma 2. The size of the grid is at most (20n — 48) x (lOn — 24). 

Proof. Since G is drawn entirely within the convex hull of vertices {vi,V 2 ,vs}, 
to find the length of the grid, we need the distance between and S 2 ,i, and to 
find the height, we need only find S 2 ,i). We know that vi never moves, so 

the length is simply the distance that V 2 is shifted to the right, which is bounded 
by ~ 4) = 4(6n — 12 — n) = 20n — 48. But this implies that the 

height of the grid is lOn — 24, and the proof is complete. 

3.4 Implementation Details and Rnnning Time 

Chrobak and Payne 0 give a linear time implementation of the algorithm of de 
Fraysseix, Pach, and Pollack 0 (which originally was published as an 0(n log n) 
time algorithm) that is easily extended to our algorithm. 

The details are omitted in this extended abstract. 

With this linear implementation, we arrive at the following theorem: 

Theorem 1. There is a 0(n) time algorithm to draw a planar graph on a grid 
with the following properties: 

— each edge is a polygonal chain with at most 2 bends per edge, 

— each edge is monotonically increasing or decreasing in both its x and y co- 
ordinates, 

— vertices and bend points are located at integer grid points, 

— the size of the minimum angle created by two edges is at least 

— the size of the grid is 0{n) x 0{n), 

— there is at least unit distance between nonincident edges, 

— and the drawing achieves edge separation with angular parameter 0=2 
separation parameter s = 1. 

Figure El is a graph on 10 vertices drawn using our algorithm. 
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Fig. 5. A drawing of a graph on 10 vertices. 



4 Drawing with Curves 

We will now use some properties of the polyline grid drawing to show that the 
edges can be replaced by curves while still maintaining the desired aesthetic 
criteria. As mentioned, the type of approximating curve used in our algorithm 
is a cubic Bezier curve. 

A cubic Bezier curve is defined by four control points Vbj hi, V2, and V3. The 
curve interpolates Vq and V3, with the two segments {Vo,V\) and (^2,^3) giving 
the tangents of the curve at the respective endpoints. The curve C is defined 
parametrically as C = Vo(l — uY + 3 Vim (1 — uY + 3V2u'^(l — u) + V^u^, where 
u € [0, 1]. A particularly useful property of Bezier curves is that the curve stays 
within the convex hull of the four control points. 

We will argue that if we replace each polyline edge by a Bezier curve, where 
the control points of the curve are the vertices and bend points of the polyline 
edge, we do not introduce any edge crossings. In fact, we also are able to maintain 
edge separation (angular and edge resolution), as defined in the framework. 

We deal with two cases, the case of nonincident edges and the case of incident 
edges. We have already argued that a join box does not intersect the convex hull 
of a nonincident edge, implying that we have unit separation of nonincident 
edges. 

We now address the case of incident edges. Since the tangents of each curve 
simply correspond to the first segments of the polyline edges, we have angu- 
lar resolution of ^ in the planar curve grid drawing. Before arguing that in- 
cident edges do not cross, recall that all edges are monotonically increasing 
or decreasing. Consider edge e. Without loss of generality, the control points 
of e are (0, 0), (xi, j/i), (0:2, 2/2), and {xsjyY, with O<X1<X2<X3 and 
0 < yi < y2 < ys- Consider another edge incident at (0,0), call it /. We want 
to examine the separation between e and /. When measuring edge separation, 
we want to consider the minimum distance between any pair of incident edges. 
Clearly, this distance would be achieved by parallel edges, if they were allowed 




164 Michael T. Goodrich and Christopher G. Wagner 



in the graph. So, we will consider the case of parallel edges and show that we 
have a measure of edge resolution for parallel edges, implying that we have that 
same edge separation for incident edges. 

So, to make e and / as close as possible, let the control points of / be 
(0,0), (xi + l,yi- 1), {x 2 + l,?/2 - 1), and (x 3 ,y 3 ), as in Figure0 




Fig. 6. Parallel edges e and /. 



The X and y coordinates of edge e are given by Xg = 3m(1 — u)^xi + 3u^(l — 
u)x 2 + v?X 3 and ye = 3u(l — u^yi + 3u^(l — u)y 2 + u^t/3, respectively. Similarly, 
the X and y coordinates of edge / are given by xj = 3m( 1 — + 1) + 3u^(l — 

u){x 2 + 1) + u^X 3 and yj = 3u(l — uY{y\ — 1) + 3u^(l — u)(?/2 — 1) + ^^2/3, 
respectively. Taking differences, we have Xf — Xg = 3u(l — u) and yf — yg = 
— 3u(l — u). This means that for any u* G [0, 1], we have a point e{u*) on e, and 
the corresponding point on /, f{u*), is 3 m(1 — u) units below and to the right of 
e{u*). For convenience, this is how we opt to initially measure the edge resolution 
of incident edges; ie, the distance between edges along a line of slope ±1. So, the 
minimum distance between incident edges is at most 3u(l — •u)j\/2. (We deal 
with monotonically increasing edges here, but the argument for decreasing edges 
is identical.) 

This fact, coupled with the monotonicity of the polyline edges implies that 
converting the polyline edges to cubic Bezier curves does not introduce edge 
crossings. To see this, suppose that e and / cross. Then there is a point on / 
that is above some point on e. Call these points /* and e*, respectively. Then 
cc(/*) = x{e*) and y{f*) > y{e*). Let u be the parameter value that gives this 
point f*. Then, e{u) is above and to the right of /*. But if this is the case, then 
e must come back down from e{u) to pass through the point e*, contradicting 
the monotonicity of the edges. 

See Figure Q for an example of a curve drawing. 

This gives the following theorem: 

Theorem 2. There is a 0{n) time algorithm to draw a planar graph on a grid 
with the following properties: 

— each edge is a cubic Bezier curve, 

— each edge is monotonically increasing or decreasing in both its x and y co- 
ordinates, 



A Framework for Drawing Planar Graphs with Curves and Polylines 165 




Fig. 7. The graph from Figure 0 drawn with curve edges. 



— vertices and control points are located at integer grid points, 

— the size of the minimum angle created by two incident edges is at least as 
measured by the tangents of the curves, 

— the size of the grid is 0{n) x 0{n), 

— there is at least unit distance between nonincident edges, 

— and the drawing has angular parameter 0=2 and separation parameter 
s = 1 for the control curves. 

5 Comments and Open Problems 

Our algorithm can easily be extended to triconnected graphs, just as Kant uni 
extends the algorithm of de Fraysseix, Pach, and Pollack. When we initially 
place a chain of vertices, we simply locate the initial point based on the vertex 
of maximum degree in the chain. Then, we shift by the sum of the degrees of 
the vertices in the chain. 

Having established a framework for drawing graphs with curves, there are 
any number of aesthetic criteria that can be redefined for curve drawings, leading 
to some directions of further research: 

— Devise new algorithms to draw graphs with curves, operating under this 
general framework. 

— Devise a dynamic algorithm for drawing graphs with curve edges. 

— If one of the control points of the curve edges is far from the remaining three, 
the curvature of the edge can be quite large, in fact proportional to n. One 
may be able to devise an algorithm that also minimizes the curvature of the 
edges. 

— Because there are roughly 4d(u) join points on v’s join box and we only use 
d{v) of them, we waste a great deal of space. There may be a way of cleverly 
reducing the sizes of the join boxes and thus the constants involved in the 
area of the drawing. 
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— Since we use the upper join points in order, often the join points near the top 
of the join box are not used. There may be some heuristics that allow for a 
better “spacing” of the upper join points once the final layout is constructed. 
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Abstract. We present a linear time algorithm that constructs a planar 
polyline grid drawing of any plane graph with n vertices and maximum 
degree d on a (2n — 5) x (|n — grid with at most 5n — 15 bends 
and minimum angle > |. In the constructed drawings, every edge has at 
most three bends and length 0(n). To our best knowledge, this algorithm 
achieves the best simultaneous bounds concerning the grid size, angular 
resolution, and number of bends for planar grid drawings of high-degree 
planar graphs. Besides the nice theoretical features, the practical draw- 
ings are aesthetically very pleasing. An implementation of our algorithm 
is available with the AGD-Library (Algorithms for Graph Drawing) [HQ. 
Our algorithm is based on ideas by Kant for polyline grid drawings for 
triconnected plane graphs I2'6l . In particular, our algorithm significantly 
improves upon his bounds on the angular resolution and the grid size for 
non-triconnected plane graphs. In this case, Kant could show an angular 
resolution of oAy and a grid size of (2n — 5) x (3n — 6), only. 



1 Introduction 

In automatic graph drawing, the task is to visualize discrete structures so that 
they are easy to read and to understand. Applications include drawing of entity- 
relationship diagrams, PERT-diagrams, and flow-diagrams (see, e.g., pTTl 1^ ). 
For a survey on graph drawing, see e.g., cnnia 

Important aesthetic criteria for nice drawings are: a small number of edge 
crossings, small area, few bends, and a good angular resolution (see, e.g., |23). 
Unfortunately, the problem of optimizing the above criteria is NP-hard, even if 
we restrict ourselves to subsets of the criteria (see, e.g., jT^I I2HH71 E!7I IT71 mij ) . 
E.g., it is NP-complete to decide whether a graph can be embedded in a grid 
of prescribed area |ES] even if we restrict ouselves to planar orthogonal grid 
drawings of trees [Z]. Furthermore, it is NP-hard to compute a drawing with the 
minimal number of crossings u^. However, it is possible to test whether a graph 
can be drawn without crossings in linear time ED- In this paper, we restrict our 
attention to planar graphs. 

* Partially supported by DFG-Grant Mu 1129/3-1, Forschungsschwerpunkt “Efliziente 
Algorithmen fiir diskrete Probleme und ihre Anwendungen” . 



S.H. Whitesides (Ed.): GD’98, LNCS 1547, pp. 167-^23 1998- 
(c) Springer- Verlag Berlin Heidelberg 1998 
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It is well-known that planar n- vertex graphs can be drawn without any bends 
on a grid of size (n — 2)x(n — 2) [01E3IHI. However, the quality of these planar 
straight-line drawings is not sufficient in practice. One reason for this is that the 
angles of the drawings get too small. Indeed, they achieve size 12 (;^). 

The term angular resolution denotes the size of the minimum angle formed 
by any two edge segments sharing a common point in a drawing. Unfortunately, 
one can show that a good angular resolution in planar straight-line drawings 
can be achieved only at the cost of the area taken up by the drawing. Garg 
and Tamassia HS| have shown that a class of planar graphs exists that require 
exponential area in any planar straight-line drawing with optimal 12(1) angular 
resolution. 

Drawings with good angular resolution and small grid size are the so-called 
orthogonal (grid) drawings. Here, only horizontal and vertical line segments are 
used for representing the edges. Hence, only fc-multiples of 90°-degree angles for 
k S {1,2, 3, 4} occur. While minimizing the number of bends in an orthogonal 
drawing of a 4-planar graph G is NP-hard ini, the problem can be solved in 
polynomial time when the embedding of G is fixed. Tamassia has presented an 
algorithm that constructs a planar orthogonal grid drawing of an embedded 
planar graph with maximum degree four with the minimum number of bends 
m- It can be shown that the size of the drawing is bounded by (n-l- 1) x (n-|- 1) 
P] and the number of bends by 2n -|- 4 if the graph is biconnected. However, 
in practice, the algorithm by Tamassia produces drawings of better quality on 
average m- Its running time is 0(n-i ^/logn ) [ 121 - 

Good bounds can also be achieved by the linear time algorithms suggested 
in P 03 03|. More precisely, for biconnected plane graphs, the constructed 
drawings need a n x n grid and have at most 2n -|- 4 bends 0303. For non- 
biconnected plane graphs, the bounds on the grid size and the number of bends 
are 1.2n x 1.2n and 2.4n-|-2, respectively 0. This can be improved to nxn and 
2n, respectively, if change is permitted to the embedding of the graph Ol- 

Unfortunately, all these algorithms work for planar graphs with maximum 
degree four only. For planar graphs with higher degree, several extensions have 
been suggested: 

(1) drawing the nodes as boxes pmiH). 

(2) introducing two grids: a coarse grid for the nodes and a finer grid for the 
line segments (Kandinsky model) jHj, and 

(3) allowing locally non-orthogonal line segments |E1| 03- 

The disadvantage of approach (1) in is that the sizes of the nodes may grow 
very large, independently of their degree. Biedl ^ suggested a proportional- 
growth model, in which an increase in the sizes of the boxes is allowed only 
according to the degree of the corresponding node. She also presents a linear 
time algorithm that constructs a planar orthogonal drawing in the proportional- 
growth model for any triconnected plane graph. In these drawings, the grid size 
is bounded by (2n — 5) x (I.5n — 3) and the number of bends by 2n — 6 Pj. 

If the user prefers equally sized nodes, approaches (2) and (3) are the method 
of choice. However, the resolution in these drawings is not always sufficient. 
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While in drawings of approach (2), line segments occur very close to each other, 
in the drawings of approach (3), the angles may get too small. Biedl ^ has 
given a linear time algorithm that constructs a planar orthogonal drawing in 
the Kandinsky model of any planar graph in a grid of size (3n — 7) x (1.5n — 3) 
with at most 3n — 8 bends. It can be shown that any drawing constructed in 
the Kandinsky model can be transformed into the proportional-growth model 
without introducing additional bends M- 

Kant m has suggested a linear time algorithm that constructs a planar 
polyline grid drawing on a (2n — 5) x (3n — 6) grid with at most 5n — 15 bends 
for a triconnected plane (simple) graph G = (V, E). The important point here is 
that the minimum angle is at least where d denotes the maximum degree of a 
node in V. For connected planar graphs, the size of the smallest angle is bounded 
by using the result that every plane graph G can be augmented by adding 
edges to a triconnected plane graph G' so that d' < |"|d] -1-3, with d and d' the 
maximum degree of G and G', respectively. In these drawings, every edge has at 
most three bends and length 0(n). Kant called his algorithm the “mixed model” 
algorithm, since he used a framework for straight-line planar drawings based on 
the shelling order (also called canonical ordering) for triconnected plane graphs 
for constructing the polyline drawings. Instead of placing single vertices as done 
in the straight-line algorithms, so-called boxes of vertices are placed. A box of 
a vertex v consists of the vertex itself and the segments representing the first 
parts of incident edges to v. 

Our new algorithm is based on Kant’s ideas. Here, we use a shelling order for 
biconnected plane graphs introduced in I2D1 for achieving straight-line drawings 
of biconnected graphs. Based on this ordering, our definition of the bounding 
boxes of the vertices is similar to that of Kant. Then we use a placement al- 
gorithm in order to place the bounding boxes as in the straight-line drawing 
algorithms [2D|- This gives a linear time and space algorithm that constructs a 
planar polyline grid drawing of a connected (simple) planar graph with n ver- 
tices and maximum degree d on a {2n — 5) x (|n — ^) grid with at most 5n — 15 
bends and minimum angle > ^, in which every edge has at most three bends. 

Note that our algorithm leads to a big improvement concerning the minimum 
angle and the grid size compared to Kant’s work. To our best knowledge, this 
algorithm achieves the best simultaneous bounds concerning the grid size, an- 
gular resolution, and number of bends for planar drawings of high-degree planar 
graphs. Besides the nice theoretical features, the practical drawings are aesthet- 
ically very pleasing. An implementation of our algorithm is available with the 
AGD-Library (Algorithms for Graph Drawing) [21 QJ- 

The paper is organized as follows: Section O contains mathematical prelim- 
inaries. The algorithm is presented in Sect. 0 First, we introduce the shelling 
order for biconnected plane graphs from then we define the bounding boxes 
and finally describe the placement algorithm. In Sect. 0 we analyze the algo- 
rithm: its correctness, the linear running time, the bounds on the grid size, the 
size of the minimum angle, and the bounds on the number of bends. Here, we 
also show that the bounds on the grid size are tight. Some practical aspects are 
discussed in Sect. o In particular, we try to locally save edge bends and some 
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grid lines. Although, in practice, most of the drawings improve significantly, the 
theoretical bounds do not improve. In the final section, we give a summary and 
discuss some open problems. 

2 Mathematical Preliminaries 

Let G = {V, E) be an undirected graph. We let (u, w) denote an edge connecting 
vertices v and w, i.e., {v,w) and {w,v) denote the same edge. The vertices v 
and w are the two endpoints of the edge {v,w). We call G simple if G contains 
neither multiple edges nor self loops. Throughout this paper, we only consider 
simple graphs. A walk in G is a sequence vgeivi . . . CkVk of alternating vertices 
and edges, such that the endpoints of are Vi-i and Ui for 1 < z < k. We usually 
write vq, ... ,Vk for short. A walk is called circular if uq = Ufe. A path is a walk 
in which each vertex appears at most once. A path is a chain in G if each vertex 
on the path has degree two in G. 

A planar drawing of a graph G is a drawing of G in the plane without edge 
crossings. A graph is called planar if it allows for a planar drawing. It is called 
plane if it is associated with a planar drawing. An embedding of a planar graph G 
is the collection of circular orderings of the incident edges (or adjacent vertices) 
around each vertex in a planar drawing of G. For a vertex v of G, we will refer 
to the counter-clockwise circular ordering around v. A plane graph divides the 
plane into regions called faces. The clockwise boundary of a face / is a circular 
walk voeivi . . . CkVk along the edges bounding / such that Vi+\ is the successor of 
Vi-i in the circular counter-clockwise ordering around vi. The counter-clockwise 
boundary is defined analogously, where Vi+i is the predecessor of Vi-i in the 
ordering around Vi. The unbounded face is called the exterior face; all other 
faces are called interior faces. Edges and vertices belonging to the exterior face 
are called exterior edges and exterior vertices, respectively; all other edges and 
vertices are called interior edges and interior vertices. It is well-known that a 
planar graph with n vertices contains at most 3n — 6 edges. 

Let W, W C E be two disjoint sets of vertices of G = {V, E). We denote by 
E{W,W) the set of edges with one endpoint in W and the other endpoint in 
W' , and by E{W) the set of edges with both endpoints in W . The subgraph of 
G induced by the vertices in IF C E consists of the vertices in W and all edges 
in E{W). A connected graph G is called k-connected if it is still connected after 
deleting any set of fc — 1 vertices. For k = 2 and k = 3, we also say biconnected 
and triconnected , respectively. 

3 The Algorithm 

In this section we describe our new drawing algorithm. Suppose the input graph 
G = (F, E) is plane and simple. The idea of the algorithm is to assign inpoints 
and outpoints to each edge of G. We draw every edge e = (u, w) as polyline that 
goes from v to the outpoint Uout of e. Then, from Uout vertically to a point, say 
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u. From u horizontally to the inpoint Uin of e, and finally from Uin to w. Thus, 
each edge gets at most three bends. 

Algorithm Mixed-Model works in three phases. In the first phase, we compute 
an ordered partition tt = Vi, . . . , Vat of the vertices in G, that is, l^U. . .UVat = R 
and Vi n Vj = ^ for i ^ j. The rank of a vertex v is the index i of the set Vi 
containing v. In the second phase, we determine the positions of the in- and 
outpoints. For each vertex v G V, we define the inpoints of all edges (v,w) 
with rank(w) < rank(v), and the outpoints of all edges (y,w) with rank(w) > 
rank{v) relative to v. In the third phase, we finally compute the coordinates 
of the vertices. We build the drawing incrementally by adding the vertices in 
Vi, . . . , VAf in the order given by tt. The ordering tt guarantees that we always 
add vertices lying in the exterior face of the current drawing. Once we have the 
vertex coordinates, the polylines for the edges are given by the in- and outpoints. 
In the following, we discuss each step in detail. 

3.1 Computation of the Ordered Partition 

Here, we explain how to compute the ordered partition tt = V\, , Vat- Let Gk 
be the plane subgraph of G induced by Fl U . . . U 14 . We want tt to have the 
following properties: 

(PI) For each set Vk = {zi , . . . , Zp\ there exist two vertices left{Vk) and right{Vk). 
Let Ai(Vfc) = {{z^,Zi+i) I 1 < j < p} for A: > 1, and E 2 {Vk) = {{left{Vk), zi), 
{zp, right (Vk))} for k>2. Then, the set 

S= U E,{Vk)\EU U E2{Vk)\E 

l<k<N 2<k<N 

can be inserted into the plane graph G without introducing any crossings. 
We denote with G = {V, E) the plane graph with E = E\J S. 

(P2) Vi = {ui, . . . , Us} is a path on the clockwise boundary of the exterior face 
of G, and the vertices in Vfe, A: > 2, lie on the exterior face of Gk. 

(P3) We define Gi to be the sequence of vertices ui, . . . ,Us. Consider & k >2 
and let Gk-i = ci,...,Cq be already defined. Let eg = left{Vk) and Cr = 
right {Vk). Then, we define Ck to be the sequence ci, . . . , c^, Vfe, c^, . . . , Cg. 
(P4) Let^Vfc = {zi, . . . , Zp}. 

(a) E{Vk) = {{zi, Zi+i) I 1 < f < pj 

(b) Let k >2, Gk-i = ci, . . . , Cq, left{Vk) = ce, and right{Vk) = Cr. Then, all 
neighbors of Vk in Gk-i lie in {cg, . . . ,Cr}. If p > 2 then Vk has exactly 
two neighbors in Gk-i- 

These properties facilitate an incremental approach of building the drawing 
by inserting step-by-step the vertices in Vk into the exterior face of Gk-l^ and 
shifting the vertices right of and including right {Vk) by some grid units to the 
right if necessary. Kant defined a suitable ordering for triconnected plane graphs 
in Figure □(a) shows that it is not satisfying to augment the given plane 
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(a) (b) 



Fig. 1. The same graph drawn with augmentation (a) and with the new shelling 
order (b) 



graph to a triconnected plane graph, and to then use the shelling order given by 
Kant. The graph depicted has been optimally augmented to a triconnected plane 
graph by adding the dashed edges. When these edges are removed, the resulting 
drawing looks strange. The reason is that vertex v has been placed before all of 
its neighbors. In 123! we have introduced a shelling order for biconnected plane 
graphs that guarantees that each set 14, fc > 2, has at least one neighbor in 
Gk-i- Figure n^b) shows the same graph drawn using the new ordering. 

Definition 1. Let G = (V,E) he a biconnected plane graph with at least three 
vertices. Let tt = (14, ... , I4v) be an ordered partition of V, that is, I4U. . .Ul4v = 
V and Vi nVj = ^ for i ^ j. Denote by Gk the plane subgraph of G induced by 
14 U . . . U Vfc ■ We say that tt is a shelling order of G if the following conditions 
are satisfied: 

(1.1) V\ = {vi, . . . ,Vs\i where ui, . . . , Us is a path on the clockwise boundary of 
the exterior face of G with s > 2 and E{{v \, . . . , zJs}) = {(ui, t’i+i) | 1 < i < 
s}. 

(1.2) Each Gk is connected and internally biconnected, that is, removing one 
interior vertex of Gk does not disconnect it. 

(1.3) Denote by C(. the walk on the counter-clockwise boundary of the exterior 

face of Gk from vi to Vg ■ For each 2 < k < N one of the following conditions 
holds, where = [i;! = ci, . . . , = ?;s]: 

(a) 14 = {z} and z is a vertex on C(, with at least three neighbors in Gk-i- 

(b) 14 = {zi, ■ ■ ■ , Zp} C C(., p > 1, and there are vertices ci,Cr, i < r, 
on C(, such that eg, zi, . . . , Zi, u \, . . . , Uj, Zi+i, ■ . . ,Zp,Cr is a path on the 
clockwise boundary of a face fk of G for some 0 < i < p, j > 0, ui, . . . ,Uj 
are vertices in G \ Gfc and E{Vk,Vi U . . . U 14_i) C {{eg, zi), (zp, Cr)}. 

We will use this shelling order in our algorithm. Since our input graph G is not 
necessarily biconnected, we first augment it to a biconnected plane graph G' = 
(V, E'). This can be done optimally using the algorithm in j2ti] . However, if we are 
allowed to change the embedding, the algorithms by Fialko and Mutzel JEl IZDj 
are the better choice. Then, we compute the shelling order tt = 14, ... , 14v for 
G' . Consider a set 14 = {zi, . . . , Zp} with k >2. We need to determine left{Vk) 
and rightiVk) (see PI). Let Gk-i = c\,...,Cq, and ei,...,6a be the edges in 
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G' with = {v^, Ci^) and G T4, such that < • • • < Iq. If a < 2, then Vk 
satisfies case 03)(b) and we set left{Vk) '■= ce and right {Vk) '■= Cr, where and 
Cr are the vertices that exist according to (P3)(b). 

If a > 3, we set left{Vk) := C£ and rightiVk) '■= Cr and determine I and r as 
follows. Let TV = {a^ \ G E}. If = 0 we set ^ := i\ and r := ia- If IV = {ct} 
we set 



t iit^ia 1 
ii otherwise] 



and r := 



t ii t ^ i \ 
ia otherwise J 



If |A^| > 2 we set £ := min{^ | G N} and £ := max{^ | G N}. 

Note that we only needed the graph G' in order to compute a feasible ordered 
partition of G. In the following, we consider the given plane graph G together 
with the ordered partition tt. 



3.2 Assignment of In- and Outpoints 

Recall that we assign an inpoint and an outpoint to each edge e = {v,w). In 
order to compute the absolute coordinates of an in- or outpoint of e efficiently, 
we only determine coordinates which are relative to either v or w. Therefore, for 
each vertex v there is a set of inpoints and a set of outpoints whose coordinates 
are relative to v. These in- and outpoints form a so-called bounding box around 
V, which we define in the following. 

We call the edges in {{v,w) \ rank{w) < rank{v)} the incoming edges of u, 
and the edges in {(u, w) \ rank{w) > rank{v)} the outgoing edges of v. We denote 
with in{v) the number of incoming edges of v, and with out{v) the number of 
outgoing edges of v. For a vertex v, we define the coordinates of the inpoints of 
all incoming edges of u, and the outpoints of all outgoing edges of v relative to 
the position of v. We point out that an edge {v,w) with rank{v) = rank{w) is 
an incoming edge of v and w, and thus is assigned two inpoints and no outpoint. 
But in this case, the edge is drawn as one horizontal line segment and we will 
not refer to its in- or outpoint in the algorithm. 

Let Vk = {zi, . . . ,Zp}, V = Zi, and set zq := left{Vk) and Zp+i := right{Vk). 
We define outi(v), outr{v), 6i and Sr according to the following table. 



OUtl{v) Si OUtr{v) Sr 


if 


out~{v) 1 out~^{v) 1 
out'^{v) 0 out~{v) 1 
out~{v) 1 out~^{v) 0 
out~{v) 0 out~^{v) 0 


in{v) > 2 

in{v) = 1 and {zi-\,Zi) ^ E 
in{v) = 1 and {zi,Zi+i) ^ E 
in{v) = 0 



where out~^{v) = 



out(v)-i 



we place the outpoints on the following points: 



and out (v) = max ^0, 'j ^ jf out{v) > 1, 



— outi{v) outpoints on the points {—outi{v), Si), . . . , {—1, outi{v) + Si — 1), 
which lie on a straight line with slope -1-1. 

— One outpoint on point (0, max{outi{v) + Si~ 1, outr{v) -I- (5^ — 1}). 
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out{v)=6, ;n(v)=l 





Fig. 2. Examples of bounding boxes 



— outr{v) outpoints on the points (1, outr{v) + i5r — 1), • . • , {outr{v), Sr), which 
lie on a straight line with slope —1. 

We set int{v) := max ^0, and irir{v) := max ^0, ^ if 

in{v) < 3, we set all inpoints on (0,0). Otherwise, we place the inpoints on 
the following points: 



— One inpoint on point (— m^(u),0). 

— ini{v) inpoints on the points {—int,{v), —1), . . . , (—1, —irii{v)), which lie on 
a straight line with slope —1. 

— One inpoint on point (0, —irir{v)). 

— irir{v) inpoints on the points (1, —irir{v)), . . . , {irir{v), —1), which lie on a 
straight line with slope +1. 

— One inpoint on point (irir(v),0). 



Figure El shows several examples of bounding boxes. The dashed lines indicate 
how vertical or horizontal segments are attached to in- and outpoints. We denote 
with outpoint (e) or outpoint{v, w) the outpoint belonging to edge e = (u, w). For 
an outpoint u, we denote with u.dx and u.dy the relative x- and y-coordinate of 
u, respectively. 



3.3 Computation of Coordinates 

Next, we show how to compute the coordinates of the vertices. Within the 
algorithm, we maintain absolute y-coordinates and relative x-coordinates. Let 
C = Cl, . . . , Cg be the current contour as introduced in Sect. Id. 11 i.e, C = Ck 
after the fcth step. For the x-coordinates 

( Xabsiv) if U = Cl 

Xabs{ci) - Xabs{ci-i) if u = c^ with i>2 (1) 

Xabs{v) - Xabs{father{v)) ii v ^ C 

holds where Xabs denotes the absolute x-coordinate. If we have to shift a vertex Ci 
on C to the right, we also have to shift all vertices Ci+i, . . . ,Cq and some vertices 
not on C. We denote with M{ci) the set of vertices that must be shifted if Ci 
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is shifted. We represent all sets M(ci), 1 < i < g, as a forest F of all vertices, 
where the roots are exactly the vertices on C. Thus, M{ci) contains the vertices 
in the tree with root c^. We denote with father{v) the unique predecessor of v 
in F, and guarantee that rank (father (v)) > rank(v) holds. We can retrieve the 
absolute x-coordinates using (P) by first traversing the vertices on C from left 
to right, and then traversing all other vertices by decreasing rank. 

Suppose we are in step k of the algorithm, that is, we have already placed 
Vi, . . . , Vfc_i, and we want to place 14 . We consider a vertex v that is already 
placed, i.e., v S Gk-i- An outgoing edge (v,w) of v is called free if w G G \ Gk, 
otherwise it is called closed. Let ei,...,em be the outgoing edges of v sorted 
by increasing x-coordinate of their outpoints. For an edge e^, 1 < /j, < m, let 
Wfj. be the endpoint yf u of e^, and denote with the rank of Note, that 
the definition of tt implies that there are indices 0<a<b<m+l, such that 
the edges ei, . . . , Ca are the closed edges with right(Vrif) = v, and e;,, . . . , 
are the closed edges with leftiVr^) = v. We define nextJeft’^(v), such that 
nextJeft^(v) < outpoint (e f^) .dx for a < p < m, and nextjright^(v), such that 
outpoint (e^).dx < nexGright^ (v) for \ < p <b\ 



Algorithm Mixed-Model-Placement 

Input: A plane graph G = (V,E), an ordered partition tt = Vi, . . . ,Vn of G 

satisfying (P1)-(P4), and in- and outpoints as defined in Sect. \^.'Z\ 
Output: Absolute y- and relative a;-coordinates according to equation (Q for 
each vertex in G. 

Initialization: We place the set 14 = {ui, . . . ,Vs}. We set 
y(vi) := 0 for 1 < i < s 
, N f outt(vi) if i = 1 

X\Vi) ■ outr(Vi-i) + OUtl(vi) -|-lif2<i<s 

C :=vi,...,Vs 

for k := 2 to N do 

Let G = Cl, Cq, Vk = {zi, , Zp}, ci = left(Vk) and Cr = right(Vk). 

For the y-coordinate, we simply set 

y(zi) := inr(zi) + max y(ci) -I- 1 for 1 < i < p 

l<i<r 

For the x-coordinate, we temporarily compute the x-coordinates of 
c^+i , . . . ,Cr relative to C£. Therefore, we set 




outpoint (ca) -dx if 1 < a < m 
0 if m = 0 



outpoint (cb) .dx if 1 < 6 < to 
0 if TO = 0 




outr(v) -|- 1 if TO > 0 and 6 = to -|- 1 



x(ci) := for £ -I- 1 < i < r. 

j=e+i 
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We set dxi such that outpoint (e) .dx < dxi for all free edges e of ci, and dxr 
such that dxr < outpoint (e) .dx for all free edges e of c^: 
dxi := nextjright^{ci) and dxr := nextJeft’^{cr) 



if in{z\) > 3 then 

We place a single vertex (see Fig. 0, i.e., p = 1. Let a^,. . . , be 

the neighbors of z\ on C. We set t := iini(zi)+ 2 - Let ut = outpoint{ct, z\) 
and dxt = Ut-dx. We want to place z\ directly above Ut- Therefore, 

x(zi) := max {x{ct) + dxt, dxi + outi{z\)} 

Let 5 = x(zi) — (x{ct) + dxt) ■ If <5 > 0, then z\ would lie to the right of Ut- 
We correct this by shifting ct, ■ ■ ■ ,Cq by S units to the right. For shifting 
Cr, ■ ■ ■ ,Cq, it is sufficient to set 

x{cr) := max{a;(cr) + <5 — x{zi), outr{z\) — dxr} 



The vertices c^+i, . . . , Cr-i disappear from C in this step, so we set their 
x-coordinates relative to z\ and update the forest F : 



else 



, . J x{ci) — x{z\) for £ < j t 

x{ci) + (5 — x(zi) for t < i < r 
father (ci) := zi for £ < i < r 



We place p > 1 vertices with at most two neighbors on C (see Fig. 0: 

, . J outi{zi) + dxi for i = 1 

' ’ [ outr(zi-i) + outi{zi) + 1 for 2 < i < p 

r 

x{Cr) := max < OUtr{Zp) — dXr,x(Cr) — X) 

1 i=l 



The vertices c^+i, . . . , Cr-i disappear from C in this step, so we set their 
x-coordinates relative to zi and update the forest F : 
x(ci) := x(ci) - x(zi) 
father {ci) := z\ for £ < i < r 

fi 

C . — Ci,...,C.^,Zi,.. ., Zp, Cr 1 • • • ^ Cq 



4 The Analysis 

In this section we show the correctness of our algorithm and analyze its running 
time and the bounds on the grid size, angular resolution, and number of bends. 
Let us analyze the running time of the algorithm. In the case in which the graph 
G is not biconnected, we can use the simple linear time augmentation algorithm 
suggested first by [23 in order to biconnect the graph. Note that this algorithm 
does not change the embedding of the plane graph G. The computation of the 
bounding boxes can obviously be done in linear time. From the outline of the 
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I i«,(Zi)=l 

max >'(Cj)=3 
/ i r 

Fig. 3. Placing a single vertex 



dx,=0 dx=-2 




Fig. 4. Placing vertices zi,.. . ,Zp 




placement algorithm it is easy to see that this part also runs in linear time. The 
correctness requires a proof which is omitted in this extended abstract. 



Lemma 1. The algorithm Mixed-Model correctly computes a planar polyline 
grid drawing for any given plane graph G in linear time and space. 

Let us now analyze the bounds achieved by the algorithm Mixed-Model. For 
reasons of limited space, we cannot give the proofs of all bounds here. 



Theorem 1. The algorithm Mixed-Model constructs a planar polyline grid draw- 
ing of any plane graph with n vertices and maximum degree d on a (2n — 5) x 
(|n — |) grid with at most 5n — 15 bends and minimum angle > |, m which 
every edge has at most three bends and length 0(n). 



Proof. The proofs of the angular resolution and the number of bends are similar 
to the ones given in | 23 |. We will give the proof for the bound on the height. We 
assume that the edges in the set S (see (P3)) have been inserted into our graph. 
Obviously, this assumption does not influence the height of the drawing. Note 
that we then have 



in. 



■ {wi) = max ( 0 



in{v) — 3 



< -in(v) — 1. 

- 2 V ; 



For each node v with rank(v) > 2 there exists a vertex w with rank(w) < rank(v) 
and y{v) = inr{v)-\-y{w)-\-l. Let wt be a node with maximum y-coordinate. Then 
there exists a sequence of nodes w\, . . . ,w± with w\ G Vi and y(wi+i) — y(wi) = 

inr(wt+i) + 1 . 

Let m be the number of edges in G. The height of the layout is 

t-i t 

y(wt) = ^(mr(wj+i) + 1) = (t - 1) + ^ inr{wi) 

i^2 ^ ^ 

1 3 7 

= < -{m-1) < -n- -. □ 
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Fig. 5. A drawing of Gio with tight 
height 



Fig. 6. A drawing of Gio with tight 
width 



The bounds for the grid size are tight. We present a graph class for which 
the bounds for the width and height, respectively, are tight. Let G„ = (W, A„) 
be the graph defined as Vn = {s,t,Wi, . . . ,Wn-2} and En = {{s,Wi), {t,Wi) \ 
i = 1, 2, . . . , n — 2} U {{wi, Wi+i) | i = 1, . . . , n — 3} U {(s, t)}. We consider an 
ordered partition with Vi = {wi,t} and Vn = {s}. This fixes the embedding 
of the graph G„. The algorithm Mixed-Model constructs the drawing shown in 
Figure 0 of height at least |n — |. The exact bound of |n — | is achieved by a 
simple triangle. Now, consider the ordered partition with Vi = {s,t}. According 
to the algorithm, the set Vi is placed with width 2n — 5 (see Fig. 1^. 

Note that Kant’s algorithm also needs width 2n — 5 for this example (and 
not 2n — 6 as claimed in Eni)- The bounds on the height cannot be achieved by 
the algorithm in EH]’ 



5 The Algorithm in Practice 

In this section we give some ideas how to modify and extent our algorithm in 
order to improve the grid size and the number of bends in practical instances. 
However, the worst-case bounds shown in Sect, ^remain unchanged. 

First, we consider vertices with degree one. We extend Mixed-Model in order 
to draw an edge incident to a vertex with degree one as a single short line 
segment. In a preprocessing step, we remove vertices with degree one from G. 
Then, we compute the ordered partition tt of the resulting graph G' . Let v be 
a vertex in G\G' and (v,w) the only edge incident to v. When we assign in- 
and outpoints, we reserve either an in- or an outpoint for (v, w) in the bounding 
box around w, on which we place v at the end of the placement step. Therefore, 
this in- or outpoint must not have relative coordinates (0,0). The placement 
algorithm needs some modifications in order to cope with the additional in- and 
outpoints. 

After placing a chain 14 = {zi, ■ ■ ■ , Zp} there may be A unused columns to 
the right of Zp (see Fig. Q- We can reuse these columns in later steps if we 
need to shift any of the vertices z\, . . . , Zp to the right. Hence, we can eventually 
reduce the width of the final drawing. The placing of a vertex v with in(v) > 3 
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Fig. 8. Postprocessing 



can be improved by computing the y-coordinate more carefully. Since not all 
inpoints of v have relative y-coordinate iuriy), we can eventually decrease the 
y-coordinate of v. Moreover, a postprocessing step can reduce the number of 
bends by local changes of the drawing. Figure 0 shows two examples, where we 
can move vertices on bend points. 

Figure 0 shows several examples produced by Mixed-Model using some of 
the modifications described above. The drawings remain readable even if they 
contain vertices of high degree. 








Fig. 9. Example drawings 
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6 Conclusions 

We have presented an algorithm that constructs a planar polyline grid drawing 
of any plane graph with n vertices and maximum degree c? on a (2n — 5) x 
(|n — |) grid with at most 5n — 15 bends and minimum angle > |. To the 
best of our knowledge, this is the algorithm achieving the best simultaneous 
bounds concerning the grid size, angular resolution, and number of bends for 
planar grid drawings of high-degree planar graphs. An implementation of the 
algorithm is available with the AGD-Library (Algorithms for Graph Drawing) 

^ . Practical experience has shown that the drawings produced by our algorithm 
are aesthetically pleasing. 

However, in general, the given graphs are not planar. In practice, the given 
nonplanar graph G is transformed into a planar graph G' , in which each crossing 
of two edges is substituted by an artificial vertex and four new edges. A possible 
planarization method is to solve the maximum planar subgraph problem and 
then to re-insert the deleted edges so that the number of crossings is minimized 
pnrr^ . Practical experiments have shown that this method produces drawings 
with a relatively small number of edge crossings HH. 

When using the algorithm Mixed-Model for such a planarized graph, the 
drawings may look strange and it is hard to identify the crossings. It is neces- 
sary to modify the algorithm so that the artificial crossing vertices are handled 
differently. It would be desirable to get mixed model drawings in which the edges 
are not allowed to bend in their crossing vertices. 
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Abstract. For a given set of n rectangles place on a plane, we consider a 
problem of finding the minimum area layout of the rectangles that avoids 
intersections of the rectangles and preserves the orthogonal order. Misue 
et al. proposed an 0(n^ )-time heuristic algorithm for the problem. We 
first show that the corresponding decision problem for this problem is 
NP-complete. We also present an 0(n^)-time henristic algorithm for the 
problem that finds a layont with smaller area than Misue’s. 



1 Introduction 

Several algorithms for automatic graph drawing have been proposed Pj2]. Most 
of the algorithms are designed to create layouts (i.e., drawings) of graphs from 
scratch. However many systems (e.g., interactive systems) need to adjust layouts 
after some modifications are made in graphs, and it is desirable to adjust layouts 
with preserving some geometric properties of the layouts. Thus, it is important 
to design layout adjustment algorithms appropriate to the systems. 

Geometric relations among vertices are very important geometric properties 
that should be preserved in adjustment of the layout. By preserving the geomet- 
ric relations in the layout adjustment, we can easily recognize the correspondence 
between vertices in the previous layout and those in the new layout. Eades et 
al.|2| proposed the following geometric relations. 

— orthogonal order: top-and-bottom and right-and-left relations between any 
two vertices; 

— proximity relation: a geometric proximity relation (e.g., the nearest relation 
between vertices); 

— topology: adjacent relations between regions of the layout. 

In this paper, we consider the orthogonal order as a geometric relation that 
should be preserved in layout adjustment. 

In some systems, vertices of a graph are sometimes represented by geometric 
figures such as rectangles or circles. Some modifications made on the graph, such 
as vertex insertion or vertex extension, may cause intersections of vertices. To 
avoid the intersections, layout adjustment is needed. Considering the display area 
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of the systems, it is important to find the intersection-free layout with minimum 
area. 

In this paper, we consider graphs where each vertex is represented by a 
rectangle and investigate the layout adjustment problem for minimizing the area 
under the following constraints. 

— The vertices (i.e., rectangles) should not intersect; 

— The orthogonal order of the vertices should be preserved. 

Misue et al. 0 proposed a heuristic algorithm for the problem. The main 
contribution of this paper is as follows. 

1. We prove that a corresponding decision problem of the layout adjustment 

problem is NP-complete. 

2. We propose a new heuristic algorithm for the layout adjustment problem. 

Our algorithm is superior to Misue’s; it finds a layout with smaller area than 

Misue’s while its time complexity O(n^) is the same as Misue’s where n is 

the number of vertices. 

This paper is organized as follows. In Section|2|and 13 we introduce some pre- 
liminaries and define the layout adjustment problem. We show the NP-complete 
result in Section 0 and present our heuristic algorithm in Section El In Section El 
we conclude this paper. 

2 Definition 

Let i? be a set of n rectangles vi,V 2 , ■ ■ ■ ,Vn- Each rectangle Vi has horizontal 
width Wi and vertical height hi, where Wi and hi are integers. We sometimes 
denote Vi by {wi, hi). A layout of i? is a function from R to coordinates on the 
plane. We denote a layout of i? by tt/? : i? — *■ for integral coordinate system, 

and ttr : R ^ for real coordinate system, where 2Z^ is an integral two 
dimensional space, and IR^ is a real one. 

Let Xi and yi be a;-coordinate and y-coordinate of a rectangle G i? in tt/j, 
respectively. That is, 'Knivi) = {xi, yi). This indicates that the coordinates of the 
center of Vi is (xi,yi) in tt/j. We assume that every rectangle is placed so that 
the boundary with length Wi is parallel to a;-axis, and do not allow rotation of 
rectangles. 

Let left^{vi) and rights (vi) be the ^-coordinates of the left and right bound- 
aries of Vi G R, respectively. The y-coordinates top^{vi) and bottom^(vi) are 
defined similarly. Formally, we define them as follows. 

left.,^{vi) = Xi- Wi/2, right^{vi) = Xi + w^/2, 
top^{vi) = yi- hi/2, bottoniTrivi) = yi + h^/2 

We also define similar notations for the layout 'kr as follows. 

WK'^r) = min right{-KR) = max right^{vi), 

Vi£R ViGR 

top^TTfi) = min top^{vi), bottom{TTR) = max bottomT^{vi) 

Vi£R ViGR 
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Let WxiT^R) and Wy{TTii) denote the horizontal width and the vertical width 
of ttri respectively. That is, 

Wx{t^r) = right{-KR) - left^iTR), Wy^iiR) = bottom^TTR) - top{TTR). 

We also use a notation (Wc(7T/{), (tt/j)) for ttr. We define an area S{ttr) of 

TTr as S{tTr) = Wx{TTR)Wy{TTR). 



3 A Layout Adjustment Problem 



We consider a layout adjustment problem for minimizing the area under the 
constraints that intersections of rectangles should be avoided and the orthogonal 
order of rectangles should be preserved. First, we define the problem as a decision 
problem, as follows. 



INSTANCE: A rectangle set R, its layout ttr, and a positive integer 
K, where 7TR(vi) yf iTR^Vj) for any two rectangles Vi,Vj G R{i yf j). 
QUESTION: Is there a layout tt^ with < K satisfying the 

following constraints (1) and (2)? 



Let {xi,yi) and (x',?/') be nR{vi) and respectively. 

(1) 7rjj preserves the orthogonal order of ttr. That is, for any two rectangles 
v^,Vj G R, 

Xi < Xj a:' < x'j , and Xi = xj a:' = a;' , and 

y^ < Vj ^ y'i< y'j, and = yj y ' = y'. 



(2) Any two rectangles do not intersect with each other in 7rjj. That is, for any 
two rectangles Vi, Vj € R{i yf j), 






or |y' -y'l > 



We denote the above problem by LADR and especially by ILADR in the case 
of integral coordinate system. 



4 The NP-Completeness of LADR 

We show that ILADR is NP-complete. It is easy to see that ILADR is in NP. 
Therefore, it is sufficient to show NP-hardness of ILADR. We reduce a well- 
known NP-complete problem 3-SATjO| into ILADR. 

Let X = {a;i,a; 2 , • • • ,Xr} be a set of boolean variables. We call Xi and Xi 
literals, and disjunction of literals clause. 3-SAT is defined as follows: 

INSTANCE: A set X of boolean variables and a boolean expression 
E = Fi A F 2 A - ■■ A Fm, where E is & conjunction of a finite number m of 
clauses, and each clause Fj = y^^ V y ^_2 V y^^s consists of three different 
literals over X . 

QUESTION: Is there a truth assignment for X that satisfies El 
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VTefrijVRfrj) VRix^) 




Fig. 1. Outline of the initial layout 7r^( e) of a rectangle set R{E) . 



4.1 The Transformation of 3-SAT into ILADR 

We transform 3-SAT with a boolean expression E into ILADR with a rectangle 
set R*{E) and its initial layout tte»[e) from E. First, we construct a partial set 
R{E) of R*{E) and its initial layout tte{e)- Other part of R*{E) is shown only 
in the proof. We set the coordinate of the upper-left corner of tte^e) to (0,0). 
The rectangle set R{E) includes a rectangle set VR{xk) for each variable Xk, a 
rectangle set CR{Fi) for each clause Fi, and a rectangle set R'{see Fig. Q. 

The rectangle set R' plays a role to restrict the positions of VR{xk) and 
CR{Fi). R' includes R” for each Fi{i = 1, • • • ,m). The initial layout of R" and 
R' is shown in Fig. El 

Figure0)a) illustrates an initial layout oiVR{xk) for a variable Xk- VR{xk) in- 
cludes rectangles vck,i for Fi(i = 1, • • • , m). VR{xk) is placed so that top{-Kyji{xk)) 
= top{nn') holds (see Fig. [Q. It has only two layouts shown in Fig.El if the area 
oiVR{xk) is restricted to (8, 2 -|- A{k — 1) -I- 2 -|- 2(4r -|- 3)m -|- 4(r — k)). We con- 
sider that Fig. Ela) (resp. Fig. EI)b)) corresponds to assigning true (resp. false) 
to Xk- Note that the two layouts differ in y-coordinate of vck^i- 

The rectangle set CR{Fi) includes a rectangle set LR{yij) for each literal 
yij{j = 1,2,3). There are two kinds of LR(jjij) and their initial layouts are 
shown in Figs.01)a) andEJa). FigureElshows LR{yij) in the case where yij = Xk 
for some Xk, and Fig. 0 shows the case where j/ij = aJF for some Xk- Every 
LR{yi,j) includes a rectangle (2,2) denoted by vsij. Let ds be the difference of 
y-coordinates between the upper boundary of LR{yi^) and the center of vsij. 
The layouts of LR{yij) are restricted only to the layouts in Figs. Eland O if the 
height is 8, c?s = 5 or c?s = 3, and the width is the minimum. We place vsij 
so to have the same y-coordinate as vck,i in VR{xk) if yi,j = Xk or Wk- The 
case of ds = 5 corresponds to Xk = true and the case of ds = 3 corresponds to 
Xk = false. Consider the case of where the height of LR{yij) is 8. If yij = true, 
that is, yij = Xk and Xk = true, or yij = xTk and Xk = false, the width can be 
10. On the other hand, if y^j = false, the width must be 12 or more. 

We now show the rectangle set CR{Fi) for the clause Fi (see Fig. ED. CR{Fi) 
includes LR{yij) and LR'{yij) (j = 1, 2, 3), a rectangle vk = {36{m + i — 2) , 4) , 
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Fig. 2. The layout of i?" and R' . 
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(a) Xk = true (b) Xk = false 

Fig. 3. Two layouts of VR{xk)- 
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(b) Xk = false 



vsij a 
(a) Xk = true 



(b) Xk = false 



Fig. 4. Two layouts of LR{yij), where Fig. 5. Two layouts of LR{yij), where 
IJi^j — ^k- 



and a rectangle Wi = (36(2m — i — 1) + 12,4). LR'{yij) consists of vCij = (4,4) 
and vfij = (6, 4). We place each LR(jjij) so that ^)) =bottom{'K (^y. ^)) 

+ A{k — 1) holds if yij = Xk or We place LR'{yij) and LR{yij) so that 

and right{'KLn(y. .)) = right^{vfij) hold. 

The initial layout of R{E) is shown in Fig. 0 We place CR{Fi) and R' 
so that top^{vli) = top{nii") holds (see Fig. 0). In the initial layout, for each 
rectangle in CR{Fi) except for vstj, there exists a rectangle in i?" with the same 
^-coordinate. When yij = Xk or ayf, the y-coordinates of vsij is the same as 
y-coordinates of vck,i in VR{xk)- Therefore, they have the same y-coordinate 
in any adjusted layout satisfying the constraint (1). CR{Fi) and CR{Fij^i) are 
apart enough for the rectangles in them not to intersect (see Fig.0. 

Each oiVR{xk) and CR{Fi) includes the polynomial number and size of rect- 
angles on r and m. R{F) includes polynomial number of VR{xk) and CR{Fi). 
Therefore the initial layout of R{F) can be constructed in polynomial time. 
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Lie'(y,i ) ) “'(3'i,3 ) 
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CR(Fm) 



Fig. 6. The initial layout of CR{Fi). 



Fig. 7. The layout of CR{Fi) and 
R'. 



Example. Figure IBI(a) shows the initial layout of R{E) for an expression E = 
{x\ V a;2 V x^) A (x2 V ^ V A V ail V a;2)- This corresponds to the truth 
assignment a;i = a;2 = 0:3 = 0:4 = true, which does not satisfy E and requires 
Wx{ttr) = 108m + 8r — 58 and Wy^nn) = 8mr + 6m + 4r. The expression E is 
satisfied by the truth assignment a;i = 3:2 = true,X 3 = Xi = false. Figure |B(b) 
shows the corresponding layout. In this case, the width is reduced to Wx{ttr) = 
108m + 8r — 62. 

4.2 Proof 

We show the reducibility of 3-SAT to ILADR. 

Lemma 1. E is satisfiable if and only if there exists a layout of R{E), 

such that it satisfies the constraints (1) and (2), and 114, < 108m+8r— 60, 

Wy(7T^(^)) = 8mr + 6m + 4r. 

Proof. (=4>) We define as the minimum width layout satisfying the following. 
Assuming that E is satisfiable, there is a truth assignment that satisfies E. First, 
we place each VR{xk) as Fig. El^a) if Xk = true, or as Fig. Eljb) if Xk = false. 
In either case, Wxi'^y jR^xk)^ ~ ^ R{xk)^ ~ hold. The 

layout 7T^, is the same as its initial layout in Fig. El where there is no gap between 
rectangles in the y-direction. 

Let yij be Xk or x^. Since each rectangle in LR{yij) except for vsij has the 
same y-coordinate as some rectangle in Rf, and vsij has the same y-coordinate 
as vck,i, ds of LR{yij) is 5 if Sfe = true and ds is 3 if Xfc = false. We place LR(yij) 
as Fig. El a) or Fig. Ha) if ds = 5, and as Fig. Hb) or Fig. m if ds = 3. From 
Figs. El and El we find = 10 if y^j is true, and = 12 

if yij is false. 

By the hypothesis, at least one literal in each Ei is true. Therefore, 
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, 108m-¥8r-58 

X2 X3 




(a) The initial layout {xi = X2 = X3 = X4 = true) 




(b) The adjusted layout (xi = X2 = true, X3 = X4 = false) 



Fig. 8. An example of the layout of R{E). 



hold. That is, < 8r + 2 + (108m — 62) = 108m + 8r — 60, and 

^vWr(e)) = ^yWvR(xu)) = 8mr + 6m + 4r hold. 

(<1=) Assume that there exists a layout of R{E) with Wfc ) < 108m+ 
8r — 60 and Wy{-K'^^^-^) = 8mr + 6m + 4r. We show that there exists a truth 
assignment that satisfies E. 

If a clause Ei has both Xk and Xk, Ei is true for any assignment. In the 
following, we consider a truth assignment for clauses that consists of three literals 
relevant to distinct variables. For a clause Ei, let each yij{j = 1,2,3) be Xkj or 
xEf- The sum of the widths of all VR{xkj) and all LR'(yij) in is 
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^xWr(^E))-{ 51 "^x{T^VR(x^))+W^{'^R:) + {W^{TTyE_.)) + {Wa,{'K„ri,^))} 

k^ki,k2^k3 

< (108m - 8r - 60)-{8(r-3) + 2 + 36(m+*-2) + 36(2m-i-l) + 12} = 58. 

Therefore, for some j, the sum of the widths of VR{xkj) and LR'{yij) is 19 or 
less. Because of > 8 and .)) > 10, < 9 

and Wx{TTRR,(^y. .^) < 11 hold. Since, the hight of the whole layout is 8mr + 6m + 
4r and the initial layout restricts > 8mr+6m+4r, Wy = 

8mr+6TO+4r holds. From WxiTTyj^f^^^ s^) < 9,m rectangles vck^,i, • • • , vckj,m are 

placed in tt^ without any gap in the y-direction. In this case, all the y-coordinates 
of m rectangles are the same as either Fig. |3a) or Fig. ETb). 

We also find that Wy(7r^,) = 8mr + 6m + 4r. This implies that the rectangles 
in R' and in LR{yij) except for vsij do not change their y-coordinates from the 
initial layout. Therefore, Wy{TTRR(^y^ .)) = 8 holds. 

Now, we consider a partial truth assignment that assigns true to Xk if all of 
vck,i, ■ ■ ■ ,vck,m have the same y-coordinates as Fig. 0(a), and assigns false to 
Xk if they have the same y-coordinates as Fig. 0b). We do not care any other 
variables. Since vsij has the same y-coordinates as vck,i, dg = 5 holds for yij if 
we assign true to Xkj, and dg = 3 holds if we assign false to Xkj ■ (Figs. 0and0). 
If 4 = 5 and y^j = x]T, then VFj;(7r^^(^. ^^) > 12 and Wx{TrfR,(^y. ^.)) > 12 hold. 
If ds = 3 and yij = Xk, then Wx{TrfR,(^y. j) > 12 hold. Therefore, because of 
^xiT^LR'iyi ■)) ^ llj Uij = Xkj holds in the case of dg = 5, and yij = xT^ holds in 
the case of dg = 3. In either case, y^j is true. That is there is a truth assignment 
that satisfies at least one literal in each clause, that is, E is satisfiable. q 

We construct R*{E) by adding a rectangle {32mr + 8r, 4) at the left side of 
R{E) so that the upper boundary of this rectangle and R{E) are the same. We 
show that 3-SAT can be reduced into ILADR using R*{E). 

Lemma 2. E is satisfiable if and only if there exists a layout 7r^.(^p where 
'^'r*{e) satisfy the constraints (1) and (2), and <S'(7r^.^^^) < (32mr -|- 108m -|- 
16r — 60)(8mr -|- 6m -I- 4r). 

Proof. If E is satisfiable, from Lemma 0 '^'r(e) be constructed so that 
^('^R*{E)) — (32mr -|- 108m -|- 16r — 60)(8mr -|- 6m -I- 4r). Let S be (32mr -|- 
108m-|-16r — 60)(8mr-|-6m-|-4r). We show that if is satisfiable if S{ttr*(^e)) < S. 
From the definition of R*{E), Wa:(7r^.(£;)) > 32mr -|- 108m -I- 16r — 64 and 
bbj/ ( tt^j. (£.) ) > 8mr -|- 6m -|- 4r. When bF?/ (^^ ) > 8mr + 6m -|- 4r, 

^i'^R'‘(E)) ^ (32mr -|- 108m -|- 16r — 64)(8mr -|- 6m -|- 4r -|- 1) 

= S' -I- 84m — 64. 

From m > 1, 84m — 64 > 0 holds. 

Therefore, bFy(7r)j.^£.^) = 8mr -|- 6m -I- 4r and bFc (tt^j. ^^^ ) < 32mr -I- 108m-|- 
16r— 60 if S(7 t)j.,^j) < S. In this case, = 8mr+6m+Ar, ) < 

108m -I- 8r — 60 hold, and from Lemma 0 E is satisfiable. m 
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Since ILADR is in NP, we obtain the following theorem. 
Theorem 1. ILADR is NP-complete. 



5 A Layout Adjustment Algorithm 

Misue et al. proposed PFS (Push Force-Scan Algorithm) in which is a heuris- 
tic algorithm to find the minimum area adjusted layout under the constraints 
that intersections of rectangles should be avoided and the orthogonal order of 
rectangles should be preserved, for a given rectangle set and its layout. In this 
section, we show a new heuristic algorithm PFS' based on PFS. This algorithm 
obtains an adjusted layout with smaller area than PFS. 



5.1 Push Force-Scan Algorithm 



An algorithm PFS uses a measure called a force to avoid intersections between 
rectangles. The force is a vector defined for each pair of rectangles. The force 
fij for rectangles Vi and Vj is used in the way that if two rectangles intersect 
then fij pushes Vj away from Vi. The direction is chosen by experience not only 
to make vi and Vj disjoint but to keep the layout as compact as possible and to 
preserve the orthogonal order. 

We define a force and other terminologies, and briefly introduce PFS. For a 
given rectangle set R and its layout tt/j, let {xi^yf) denote a coordinate of the 
center of a rectangle Ui(G i?), that is, 'XR{vi) = {xi,yi). Differences Axij and 
Ayij of coordinates between Vi and vj are defined as follows. 

Ax^j = Xj - Xi, Ayij = yj - y^ 



Two different rectangles Vi and vj intersect each other if the following condition 
holds. 



\Axij I 



< 



+ Wj 
2 



and 






hi -t- hj 
2 



Let L be the line from the vfs center to the Vj’s center. Consider that we move Vj 
along L to the point where Vj touches vt without intersections and preserving the 
orthogonal order. A force ftj = {ffj,fij) is defined as the vector from (xi,yi) 
to that point. Let gij be the gradient of L, that is, gij = Ayij/Axij (gij = oo 
if Axij = 0). Let Gij be {hi + hj)/{wi + Wj). 



a) The case where Vi and Vj touch with y-direction boundaries, that is, the case 
of Gi^j P 9i,j ^ dj ^ d 01" 9i,j — d. 






- Jlj = flj- 9^,0 
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[Algorithm Horizontal-PFS] 
begin 

i 1 ; 

while {i < n) do begin 

k ■.= max{j\xi = Xj}; / *Xi = Xi+i = ■ ■ ■ = Xk */ 
5 max(0, max ^); 

for j k 1 to n do 

Xj Xj + 5; 

i k + 

end; 

end. 



Fig. 9. Algorithm Horizontal-PFS. 




(a) An initial layout (b) An adjusted layout byPFS (c) An adjusted layout byPFS' 



Fig. 10. An example of PFS and PFS' (1). 



b) The case where Vi and Vj touch with a;-direction boundaries, that is, the case 
of ^ ^ or ( Gij > 9'i,j') ^ {9'i:j ^ 



fy 

J i. 



f hi + hj 

I 2 






Now, we introduce PFS. PFS finds the adjusted layout satisfying the con- 
straints in 0(n^)-time (n = |i?|). PFS applies forces in the a;-direction first, 
then in the y-direction. First one is called Horizontal-PFS and the other is called 
Vertical-PFS. Vertical-PFS is the same as Horizontal-PFS except for the applied 
direction. Therefore, we present Horizontal-PFS only. 

Horizontal-PFS is shown in Fig. El Assume that xi < X 2 < ••• < Xn- 
Horizontal-PFS decides ^-coordinates of rectangles in the order vi, - ■ ■ ,Vn- The 
rectangles with the same initial x-coordinate are decided at the same time. 
When it decides the a;-coordinates for Vi, - ■ ■ ,Vk, it also moves all the rectan- 
gles Vm{i < m < k) and Vj{k < j < n) by the same distance in the a;-direction. 
This distance depends on Vj{k < j < n) as well as < nr < fc). 

PFS restricts the movement only to the positive direction. Misue et al. also 
proposed another algorithm, Push-Pull Force-Scan algorithm, which allows the 
movement in the negative direction. This algorithm does not always guarantee 
the disjointness. Therefore, we do not deal with it. 



5.2 The Improvement of PFS 

In some case, PFS is not efficient. We now consider the case in Fig. [HU Fig- 
ure EUa) shows an initial layout, and Fig. nTil hl shows its adjusted layout by 
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[Algorithm Horizontal-PFS^] 
begin 

i 1 ; 

(T 0; 

Imin 1; 

while {i < n) do begin 

k max{j|a:i — Xj}; /* Xi — = • • • = Xk */ 

7 0; 

if (xi > xi) then 

for m i to k do begin 

■y" max {jj + /_,%); 
i<j<i 

1 — 

( (T if Lbnd{Vm,Xm) + l" < l-bnd(vimin, Xlmin) 

[ y" otherwise 

y max(7, 7 ); 

end; 

for m i to k do begin 

7m — 7 ; 

Xjn Xm + 7m; 

if Lbnd{vm,Xm) < Lbnd{vimin,ximin) then 
Imin m; 

end; 

(7 := a + max(0, max fm 

i<m<k<.j<n ’ 

i k + 

end; 

end. 



Fig. 11. Algorithm Horizontal-PFS'. 



PFS. In this case, first, V2 and are moved to the right by /f 3, and then is 
moved by /fa again. Therefore, a needless gap appears between v\ and V2- 

Here, we propose an algorithm PFS', which obtains an adjusted layout with 
smaller area than the layout obtained by PFS. PFS' has the same time com- 
plexity 0 {v?) as PFS'. Similarly to PFS, PFS' executes Horizontal-PFS' and 
then Vertical-PFS'. We show only Horizontal-PFS'. 

Again, we consider the example in Fig. In this case, it is sufficient for 
V2 to be moved by /f ^ and for U3 to be moved by max{/f 2 + /f.3) /ysl- PFS' 
generalizes this idea. Assume that x\ < X2 < ■ ■ ■ < Xn- Horizontal-PFS' is shown 
in Fig. El A function Lbnd{vi,Xi) is the x-coordinate of the left boundary of 
Vi when the x-coordinate of vi is Xi. Horizontal-PFS' decides a;-coordinates of 
rectangles in the order vi, ■ ■ ■ ,Vn, where the rectangles with the same initial 
x-coordinates are decided at the same time. When it decides the ^-coordinate 
for Vi, - ■■ ,Vk, the movement distance depends only vi, - ■ ■ ,Vk except for some 
special case. This is different from PFS. We explain how to decide x-coordinates 
of Vi, - ■ ■ , Vk- Assume that a;-coordinates of ui, • • • , Vi-i have been decided. Let 

be the distance by which Vj is moved by PFS' in the x-direction. Except for 
the special case mentioned later, Horizontal-PFS' decides 7m (* < m < k) as the 
maximum value of -I- /f for 1 < j < i and i < m < k. 

The exception is as follows. Let am be the distance by which Vm{i ^ m < k) 
is moved to the right in PFS. The movement 7m may place some Vm so that the 
left boundary of Vm is farther left than any other rectangles whose a;-coordinates 
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have been decided. In this case, the area may become larger than PFS. To avoid 
this, we decide the movement distance as Um instead of 7^ in this case. 



5.3 The Validity of the Algorithm 

We prove that the area of the layout by PFS' is not larger than one by PFS, 
and that the layout by PFS' satisfies the constraints (1) and (2). 

Let 7T^ and tt^ be the layout of R by PFS and PFS', respectively. Let Xi, 
x\ and x" be a;-coordinates of Vi in the initial layout, tt^ and tt^, respectively 
(1 < i < n). Let ai and 7^ be the distance by which PFS and PFS' moves 
Vi in the x-direction, respectively. PFS calculates ai as follows, where I is the 
minimum m satisfying Xi = Xm- 



O’* — <^o + <5i + • • • + Si-i 

{ 0 if t = 0 or Xi = Xi+i 

max(0, max ■) if Xi < Xi+i 

l<m<i<j<n 

PFS' uses the following 7'' and 7' to calculate 7i, where I is the minimum m 
satisfying Xi = Xm, and 



7" = max (7j + /L) 



i<j<i 



/ 



= 



ai if Lbnd{vi, Xi) + 7" < min Lbnd{vj,Xj + jj) 

i<i 

7" otherwise 



7, = max 7^ 



Xi—Xm 



Lemma 3 . For all i{l < i < n), (a) ai > 7", and (b) x' > x" hold. 

Proof. For all i, we show ai > 7" and x' > x" by induction. For all i such that 
xi = Xi, ai = 7i' = 7i = 0 hold. Therefore, xi = Xi + di = Xi + 7i = x" holds. Let 
X/, • • • ,Xfc be the maximal sequence with the same x-coordinate. Assume that 
x'j > x", that is aj > 7j, for j < 1 . For all i such that I < i < k, ^i = and 
di = d; hold. Therefore, it is sufficient to show d/ > 7; for di > 7i and then 
x'i > x”{l < i < k). For I < i < k, is calculated as follows. 

7" = max (7j- + fP) < max (d^- + /L) 



Let Ij and kj be the minimum and the maximum indices such that xq = Xj and 
Xkj = Xj hold, respectively. 



fl,n < , max //, 

</Cj<m <n 

< max(0, max fp ,) = 4, 



Because of aj < d^^, then we show di > 7" for I < z < A:. 
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7" < max (cTj + /f J < max {aj + Sk.) < max (crfe. + 4, ) < CTfc.+i <<Ji^(Ji 
i<j<r i<j<i ^ i<j<i 



From Gi > 7", Gi > 7' holds. Because of ct; = • • • = Ufe, 7i = max 7^ < 

l<m<k 

max (Tm = cTi. Therefore, a;' > x" holds. □ 



Lemma 4. and tt^ satisfy the following conditions. 

Wfr(Trfl) < V4«) and Wy(7ry < Wy(7r^) 

Proof. We only prove Wa,(7r^) < 114 Let I' be the smallest index among 
the rectangles whose left boundaries are the left boundary of tt^. Let r' be the 
smallest index among the rectangles whose right boundaries are the right bound- 
ary of 7T^. We define I" and r” for tt^ similarly to I' and r' for tt^, respectively. 
It is sufficient to prove that 

left^,{vn) < left^„{vri) and right^,{vr') > right^,, {vr") . 

If xi'i = xi, then 7/// = gi>i = 0. In this case, left^,{vn) < left^,{vi") = 
left^„{vi'i) hold. Consider the case where x\ yf xin. The rectangle vin is the 
widest among the rectangles vi ^„ , • • • , Vk^,, with the same x-coordinate. Let Imin" 
be the value of a variable Imin after PFS' decided di for i = 1, • • • , kn — 1. Since 
left^„{vi») < left^„{virmn"), PFS' fmds Lbnd{vn' ,xi»ypfl'„ < Lbnd{vimin" , ximin" 
+ limin")- sets 7/» = (Tz». This implies left^,{vi>) < left.^,{vw) = left^„{vi»). 

From Lemma El right^,{vr") > rightj^„{vr») holds. Therefore, right.^,{vr>) > 
rightj^,{vr") > right.^ii{vr") holds. q 

Next, we show that the adjusted layout by PFS' satisfies the constraints. 

Lemma 5. For any two rectangles Vi,Vj G R{i < j), — 7i > ffj holds. 

Proof. In the case of Xi = Xj, 7i = 7y and ffj = 0, 7^ — 7i > /A holds. Consider 
the case of Xi < Xj. Let I and k be the minimum and maximum indices such 
that xi = Xj and Xk = Xj , respectively. For all m such that I < m < k, 

7m = max (7i/ -b ff,^^) > 7i + fij- 

From Lemma El 7m 4 holds, and moreover, 7^ = max 7^ and 7^ = g^ or 

l<m<k 

7^ holds. We find 7^ < 7j for ^ < m < fc. Therefore, 7^ > 7i -b ffj holds. q 

Lemma 6. The algorithm PFS' preserves the orthogonal order of the initial 
layout (the constraint ( 1 )). 

Proof. If Xi = Xj, then x'f = x" holds. Consider the case Xi Xj. Assume 
Xi < Xj w.l.o.g. By Lemma 0 and the definition of ffj, — 7i > ffj and 
Xi < Xj + ffj hold. Therefore, x” = Xi+^i < x^+jj-ffj < Xj+^j = x" holds. □ 
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(b) (the layout by PFS) 




(c) (the layout by PFS') 



Fig. 12. An example of PFS and PFS' (2). 



We can see that the layout by PFS' satisfies the constraint (2) from Lemma0 
and the definition of /f^. Then we have the following lemma and theorem. 

Lemma 7. Algorithm PFS' guarantees the disjointness of rectangles (the con- 
straint (2)). 



Theorem 2. PFS' adjusts the layout in 0(nf) time, and the result satisfies the 
constraints (1) and (2) and the area is smaller than the result o/PFS. 

Example. Fig. 1 1 /I illustrates an example of applying PFS and PFS' for a given 
set R and its layout tt/j. In this case, PFS'(Fig. IT^ cii obtains much smaller 
area than PFSlFig. IT^' hlV 

6 Conclusion 

We considered the layout adjustment problem for minimizing the area under the 
constraints that intersections of rectangles should be avoided and the orthogo- 
nal order of rectangles should be preserved, and showed that the corresponding 
decision problem on the integral coordinate system is NP-complete. We also pro- 
posed a heuristic algorithm for this problem applicable to both (on the integral 
and real coordinate system). Our algorithm obtained smaller area than the al- 
gorithm proposed by Misue et ah, while both algorithms have the same time 
complexity. 

It would be interesting to find NP-completeness of layout adjustment prob- 
lems that guarantee different constraints, and to provide much better heuristic 
algorithms. 



References 

1. G. Di Battista, P. Fades, R. Tamassia, and I. G. Tollis, “Algorithms for drawing 
graphs: an annotated bibliography,” Computational Geometry Theory and Appli- 
cations, vol. 4, pp. 235-282, 1994. 



A Layout Adjustment Problem for Disjoint Rectangles 197 



2. R. Tamassia, G. Di Battista, and C. Batini, “Automatic graph drawing and read- 
ability of diagrams,” IEEE Trans, on System, Man and Cybernetics, vol. 18, no. 1, 
pp. 61-79, 1988. 

3. P. Eades, W. Lai, K. Misue, and K. Sugiyama, “Preserving the mental map of a 
diagram,” Proc. of COMPUGRAPHICS ’91, pp. 34-43, 1991. 

4. K. Misue, P. Eades, W. Lai, and K. Sugiyama, “Layout adjustment and the mental 
map,” Journal of Visual Languages and Computing, vol. 6, pp. 183-210, 1995. 

5. M-A. D. Storey and H. A. Muller, “Graph layout adjustment strategies,” Proc. 
Graph Drawing ’95, LNCS 1027, pp. 487-499. Springer- Verlag, 1995. 

6. M. R. Garey and D. S. Johnson, “Computers and Intractability — A Guide to the 
Theory of NP-Completeness,” W. H. Freeman and Company, New York, 1979. 




Drawing Algorithms for Series-Parallel Digraphs 
in Two and Three Dimensions* 



Seok-Hee Hong^, Peter Eades^, Aaron Quigley^, and Sang-Ho Lee^ 

^ Department of Computer Science and Engineering, 

Ewha Womans University, Korea. 

{shhong, shlee}@cs . ewha. ac .kr 
^ Department of Computer Science and Software Engineering, 
University of Newcastle, Australia. 

{eades , aquigley}@cs .newcastle . edu. au 



1 Introduction 

Series parallel digraphs are one of the most common types of graphs: they appear 
in flow diagrams, dependency charts, and in PERT networks. Algorithms for 
drawing series parallel digraphs have appeared in 

In this paper we describe algorithms which can draw series parallel digraphs 
in two and three dimensions. Sample drawings are in Figure^ Specific variations 
of the algorithms can be used to obtain symmetric drawings, or drawings in which 
the “footprint” (that is, the projection in the xy plane) is minimized. 

This extended abstract is organized as follows. In the next section, we sum- 
marize the necessary background for series parallel digraphs. Then concepts for 
symmetric drawings, especially with respect to series parallel digraphs, are pre- 
sented in Section 0 The two dimensional algorithm is given in Section building 
on the two dimensional algorithm, the three dimensional algorithm is given in 
Section 0 

2 Series Parallel Digraphs 

First we review some of the fundamental notions for series parallel digraphs. A 
digraph consisting of two vertices u and v joined by a single edge is a series 
parallel digraph, and if Gi and G2 are series parallel digraphs, then so are the 
digraphs constructed by each of the following operations: 

— series composition: identify the sink of Gi with the source of C?2- 

— parallel composition: identify the source of Gi with the source of G2 and the 
sink of Gi with the sink of G2. 

* This is an extended abstract. This research has been supported by an Aus- 
tralian Research Council Grant, KOSEF No. 971-0907-045-1, and the SCARE 
project at the University of Limerick. Note that the three dimensional draw- 
ings in this paper are static. Animated drawings are available from A. Quigley. 
http://www.cs.newcastle.edu.au/~aquigley. This paper was partially written 
when the first author was visiting the University of Newcastle. 



S.H. Whitesides (Ed.): GD’98, LNCS 1547, pp. 198-|2nSI 1998- 
(c) Springer- Verlag Berlin Heidelberg 1998 
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Fig. 1. Drawings output by the algorithms described in this paper. 



The subgraphs Gi and G 2 are components of G. Series parallel digraphs may 
be represented as decomposition trees in!> such as in Figure Ela). Leaf nodes 
in the tree represent edges in the series parallel digraph, and internal nodes are 
labeled S' or P to represent series or parallel compositions. 

Because parallel composition is commutative and both series and parallel 
compositions are associative, there may be more than one binary decomposition 
tree for a series parallel digraph. This means that an algorithm based on the 
binary decomposition tree cannot fully display symmetries. To overcome this we 
construct the structure tree (sometimes called a canonical decomposition tree) in 
which the same composition operations are placed on the same level. Figure|21(b) 
shows the structure tree corresponding to Figure Ela). The structure tree can be 
computed in linear time using the algorithm of Valdes et al. USED followed by 
a simple depth-first-search restructuring operation. The structure tree is unique 
up to the ordering of siblings. 

The A-algorithm 00] draws series parallel digraphs. The algorithm produces 
grid drawings with straight-line edges. It has been claimed that this algorithm 
can be varied to display symmetries. However, at best, the Z\-algorithm displays a 
subset of the set of possible symmetries. The method in Section EJbelow displays 
all possible symmetries. 

3 Symmetric Drawings in Two Dimensions 

To ensure that all possible symmetries are displayed, it is important to use a 
rigorous model for the intuitive concept of symmetry display. In this section we 
describe such a model, derived from those introduced by Manning mm and 
Lin m. 

Symmetries of a graph drawing correspond to automorphisms of the graph. 
However, some automorphisms cannot be displayed as symmetries of any graph 
layout. Further, it is possible to have two automorphisms, each of which can be 
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displayed, but for which there is no drawing which displays both. See |4lt)| for 
examples. For these reasons, we define “geometric automorphism group” in the 
following section, and indicate how this notion relates to symmetry groups of 
graph drawings. 






(a) Decomposition tree 



Fig. 2. (a) A binary decomposition tree, and (b) the equivalent structure tree. 
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3.1 Geometric Automorphisms of Graphs and Symmetries of 
Graph Drawings 

We need some of the terminology of permutation groups; for more details see HHI. 
We denote the identity permutation by I. The group generated by ai, 02 , . . . , Ofc 
is denoted by < ai, 02 , . . . , >. If a permutation p acting on a set V has a fixed 

element v € V, that is, p{v) = v, then p induces a permutation pyOnV — {f }. A 
permutation group P is semiregular if each non-identity permutation in P does 
not have a fixed element. 

A permutation p on y is a rotational permutation if either < p > or < py > 
(for some v G y) is semiregular, and | < p > | > 1. Note that a rotational 
permutation has at most one fixed element. A permutation p on y is an axial 
permutation if p^ = I and p is a non-identity permutation. A permutation 
is a geometric permutation if it is either an axial permutation or a rotational 
permutation. 

A permutation group P on y is geometric if it is one of the following types: 

1. P =< q > where q is an axial permutation; or 

2. P =< p > where p is a rotational permutation; or 

3. P =< p,q > such that: 

(a) p is a rotational permutation and q is an axial permutation, and 

(b) < p > n < g >= {/}, and 

(c) qp = p-^q. 

A subgroup P of the automorphism group of a graph G is geometric if P is a 
geometric permutation group on V. 

Next we consider graph drawings. The symmetries of a bounded set of points 
in the plane (such as a two dimensional graph drawing) form a group called 
the symmetry group of the set. A symmetry a of a drawing P of a graph G 
induces an automorphism p of G if the restriction of a to the points representing 
vertices of G is p. A drawing P of a graph G displays a geometric automorphism 
p of G if there is symmetry a of P which induces p; P displays a geometric 
automorphism group P of a G if P displays every element of P. It is easy to 
see that the symmetry group of a graph drawing induces a geometric subgroup 
of the automorphism group of the graph. The converse was proved by Lin p] 
and Manning m- for every geometric automorphism group P of a graph G, 
there is a drawing P of G which displays P. In Section E] we show that for every 
geometric automorphism group P of a series parallel digraph G, there is a planar 
drawing P of G, of the type illustrated in Figure Q which displays P. To apply 
this result, however, we must compute geometric automorphism groups of series 
parallel digraphs. In general, the problem of finding a geometric automorphism 
of a graph is NP-hard |'/l 1 1 )j : it may be strictly harder than the problem of 
finding the automorphisms of graphs in general (which is merely isomorphism 
hard mi)- The next section shows that for the case of series-parallel digraphs, 
finding geometric automorphisms is not difficult. 
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3.2 Geometric Automorphisms of Series Parallel Digraphs 

In this section we sketch an algorithm which finds geometric automorphisms 
for series parallel digraphs. The geometric automorphism group obtained in this 
way is used explicitly to draw the graph symmetrically. 

For the purposes of this paper, an automorphism of a digraph either main- 
tains the direction of all directed edges, or reverses all directed edges. Thus such 
an automorphism maps a source to either a sink or a source; further, it maps 
cut vertices to cut vertices. This leads directly to the next lemma. 

Lemma 1. The automorphism group of a series parallel digraph contains at 
most two axial geometric automorphisms and at most one rotational geometric 
automorphism (which must have degree 2.) 

Proof. Omitted. 

An important consequence of this Lemma is that for series parallel graphs, 
there is a single maximal geometric subgroup of the automorphism group. Fur- 
ther, we can derive a method for finding all geometric automorphisms of a series 
parallel digraph. Roughly speaking, the method proceeds as follows. 

1. We construct the structure tree mM- 

2. We label the structure tree. The labeling is canonical, in the sense that 
isomorphic blocks have equal labels. The labeling can be computed in linear 
time by adapting the tree isomorphism algorithm mni. This labeling step 
is the critical part of the algorithm. 

3. We check for the existence of each of the geometric automorphisms men- 
tioned in Lemma ^ 

The complete algorithm can be implemented in linear time; details will appear 
in the full version of this paper. 

4 The Two Dimensional Drawing Algorithm 

First we describe a simple procedure for giving a visibility representation |12| of a 
series parallel digraph G. In the representation that we construct, the horizontal 
line segment for the source is a vertical translation of the horizontal line segment 
of the sink. 

For a graph which consists of single edge, such a representation is simple. 
Suppose that Di and D 2 are visibility representations of series parallel digraphs 
Gi and G 2 respectively. If G is a series composition of Gi and G 2 , then we can 
construct a representation D of G by “stretching” the narrower of Di and D 2 
and identifying the source of one with the sink of the other; see Figure 01^ a). If G 
is a parallel composition of Gi and G 2 , then we can construct a representation 
D of G by “stretching” the shorter of D\ and D 2 and identifying their sources 
and sinks; see Figure Eljb). 

Two traversals of the structure tree can be used to compute the visibility 
representation. One traversal computes the size of the enclosing rectangle for 
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Fig. 3. Constructing visibility representations of series parallel digraphs. 



each component, the next computes the route for each edge. This works in linear 
time. The details will appear in the full version of this paper. 

This visibility representation can be transformed to a quasi-orthogonal draw- 
ing in a simple way; Figure^ shows a quasi-orthogonal drawing obtained from 
Figure 0b). Note that the source and the sink share a vertical line; this is 
important in Section |3 for drawing in three dimensions. 




Fig. 4. Quasi-orthogonal drawing of a series parallel digraph. 



^ Strictly speaking, this is not an orthogonal drawing, since the edges overlap. The 
precise dehnition of this class of drawing is involved and we will omit it in this 
extended abstract. 
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Our algorithm places parallel components across the page in the same order 
that they appear in the structure tree. To display symmetry, we need to order the 
children of each node corresponding to a parallel composition in the structure 
tree before applying the drawing algorithm. For example, Figure EKa) has no 
symmetry; Figure0(b) is a symmetric drawing of the same graph. The difference 
between Figure Ea) and (b) is the left-right order of components of parallel 
compositions. It can be shown that the order can be chosen to display the any 
geometric automorphism group of a series parallel digraph. 





Fig. 5. Symmetric and asymmetric drawings of a series parallel digraph. 



Theorem 1. Then there is a linear time algorithm which constructs quasi- 
orthogonal drawings of series parallel digraphs such that the output is planar, 
and displays every geometric automorphism of the input. 

Proof. The algorithm uses the same labeling technique used for computing geo- 
metric automorphisms. Details are omitted in this extended abstract. 

The drawings obtained by this algorithm are not grid drawings. However, 
they do have good area bounds, in the sense that if the minimum distance 
between a pair of vertices is one, then the drawing is 0{n) x 0{n). It is possible to 
vary the algorithm to give straight-line drawings. However, note that a straight- 
line drawing may require exponential area (see | 2 |). 

5 Drawing Series Parallel Digraphs in Three Dimensions 

In this section we present an algorithm for producing three dimensional drawings 
of series parallel digraphs. The drawings improve on the resolution of the two 
dimensional drawings. Note that as long as we keep to the rule that the minimum 
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distance between a pair of vertices is one, improvements to resolution can be 
obtained by reducing the extent of the drawing in each dimension. 

Consider the drawing in Figure E| obtained from the algorithm described 
in the previous section. Suppose that this drawing is in the xz plane within a 




three dimensional space, with the z axis vertical on the page. For each parallel 
node V in the structure tree, the children of v are aligned with the x axis. Note 
the source and sink of each component share a vertical line. We can rotate a 
component about this line so that it aligns with the y axis; this is illustrated in 
Figure [71 

The rotation is the basic operation used to improve resolution. For each 
parallel node v in the structure tree, we can choose to align the children of v 
either in the x direction or in the y direction. Such a choice is illustrated in 
Figure 0 The result of these choices is illustrated in Figure 0 

The z extent of the three dimensional drawing is fixed by the height of the 
structure tree; we concentrate on reducing the x and y extents. The footprint 
of a three dimensional graph drawing is the projection of the drawing in the xy 
plane. To improve the resolution, we need to reduce the size of the footprint. If 
the minimum enclosing rectangle R of the footprint has dimensions X xY, then 
the size of the footprint is max(AT, Y). 

In fact, we can find a choice of a; or y alignment for each parallel composition 
in such a way that it minimizes the size of the footprint. We use a dynamic 
programming approach, along the lines of methods for drawing two dimensional 
“hv-trees” |S|. 
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Fig. 7. Three dimensional drawing obtained by executing the rotation indicated 
in Figure\^ 




Fig. 8. Choices of x or y alignment for nodes in the structure tree. 



We say that a layout is minimal if its footprint has size X x F, and there 
is no layout with footprint of size X' x Y' where X' < X, Y' < F, and 
{X',Y') yf (X,Y). A layout with a (globally) minimum size footprint is among 
those of minimal footprint. There may be many minimal layouts of a series par- 
allel digraph. The algorithm computes all minimal layouts, and chooses one with 
a minimum size footprint. The algorithm proceeds from the leaves of the struc- 
ture tree to the root: at each internal node it computes the minimal layouts of 
that component. 

The footprint of a leaf in the structure tree (that is, an edge in the graph) 
has dimensions 1x1. Minimal layouts for a component represented by a node ly 
in the structure tree can be computed from minimal layouts of the components 
represented by the children of as follows. 
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Fig. 9. The three dimensional drawing resulting from the choices in Figure\^ 

Suppose that is a node in the structure tree with children fii, fi 2 , ■ ■ ■ , Fk, 
and the footprint of fSi has dimensions x for 1 < i < A:. 

First suppose that v represents a series composition. Then the footprint for 
V has size X x Y where 



X = max(A:i,X2, . . -,Xk), 

y = max(ri,y2,...,n). 

This is illustrated in Figure uni As mentioned previously, each component may 
have many minimal layouts. We store all the minimal layouts for each child fj,i 
of as a list 

= {{xiX), . . . , y™')) 




Fig. 10. The footprint of a series composition. 
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of pairs such that is decreasing in X coordinate. Note that since each element 
is minimal, is increasing in Y coordinate. A list of minimal layouts for v 
can be computed from , . ■ . , i/i*. using a kind of merge algorithm below. 

Here (C^, Cy) is a candidate for a minimal footprint layout for v, and pi is the 
pointer to the current element of list . 

1. Choose £ such that Xj is maximized; (Cx,Cy) = {Xj,Yl). 

2. Ly={{C^,Cy)). 

3. For i = 1,2, ... ,k, Pi = 1. 

4 . Pi = 2 . 

5. While Pi < rui for each i: 

(a) C,, = max ,Xp^ , . . . ,XP<^). 

(b) Cy = max(yp^yp^...,yp''). 

(c) Suppose that {LASTj,, LASTy) was the last element appended to L,^. 

If LAST,, < C„ and LASTy > Cy 

then append {Cx,Cy) to 

else replace {LAST^, LASTy) by {Cx,Cy) in L^.. 

(d) Choose i such that X^ is maximized. 

(e) pi=pi + 1. 

Step 5(c) ensures that the elements of L,, are minimal. The choice of £ at step 
5(d) can be done using an indexed priority queue; this has amortised constant 
time per access. Thus the algorithm takes time proportional to the sum of the 
lengths of the input lists. 

In the case that v represents a parallel composition, we need to choose 
whether to align the children in the x direction or in the y direction. An x 
alignment for v has dimensions X x Y where 

X = Xi + X 2 + • • • + Afc, 

Y = max(Yi,y2, • • ■ ,dfc), 

and a y alignment for v has dimensions X xY where 

A = max(Ai, A 2 , . . .,Xk), 

Y = Yi+Y2 + --- + Yu. 



One can use these equations with merge operations in a similar but more com- 
plex way to the method for a series composition (see 0) to compute all minimal 
footprint layouts for the graph. The details are omitted for this extended ab- 
stract. 

The complete algorithm works in time 0{n^). 

Theorem 2. There is an algorithm which computes a minimum size footprint 
layout of a series parallel digraph in time Ofnf). 



Proof. Omitted. 
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6 Conclusion 

In this paper we have introduced two algorithms for drawing series parallel 
digraphs. One constructs two dimensional drawings which display symmetries, 
the other constructs three dimensional drawings with a footprint of minimum 
size. 

Future work will include combinations of these two algorithms: we would like 
to display as much symmetry as possible in a three dimensional drawing of small 
footprint. 
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Abstract. We address the problem of finding viewpoints that preserve 
the relational structure of a three-dimensional graph drawing under or- 
thographic parallel projection. Previously, algorithms for finding the best 
viewpoints under two natural models of viewpoint “goodness” were pro- 
posed. Unfortunately, the inherent combinatorial complexity of the prob- 
lem makes finding exact solutions is impractical. In this paper, we pro- 
pose two approximation algorithms for the problem, commenting on their 
design, and presenting results on their performance. 



1 Introduction 

Since it was first considered by the graph drawing community [till ()] . there has 
been much research into three-dimensional graph drawing. There is some exper- 
imental evidence that three-dimensional graph drawings have advantages over 
their two-dimensional counterparts. It is claimed m that three dimensions al- 
low users to work with larger graphs - the natural three-dimensional actions of 
rotation and translation allow a user to resolve ambiguities in large drawings 
while maintaining their overall mental map P3- 

Ware et al. m conducted a series of experiments on finding paths between 
vertices in a three-dimensional graph drawing, under a variety of display and nav- 
igation combinations. They discovered that giving users control of bidirectional 
rotation results in lower error rates than continuous unidirectional rotation, but 
at the cost of increased decision time. In [Z|, we conjectured that this increase is 
partly due to the time taken to manually select good viewpoints. 

Most current systems leave the selection of good viewpoints entirely to the 
user. We propose that the user should be able to specify those parts of the 
graph drawing they wish to focus on, then the system should automatically move 
them to a good viewpoint. For interactive applications, the movement needed 
to maintain the illusion of three dimensions |2] can be achieved by continuously 
moving between several viewpoints, or by “rocking” around a single viewpoint. 

In graph drawing, Kamada and Kawai HD model good viewpoints as those 
that preserve the “shape” of a three-dimensional wire-frame drawing, by ex- 
cluding viewpoints from which edges appear collinear. They describe the bad 
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viewpoints of their model as a set of great circles on the unit sphere, and present 
an algorithm to find the best viewpoints of a graph drawing (those viewpoints 
which are furthest by great-circle distance from the nearest bad viewpoint). 

Bose, Gomez, Ramos and Toussaint |2| model good viewpoints for wire-frame 
drawings as those from which no vertex- vertex or vertex-edge pairs occlude each 
other, and no three edges appear to cross at the same point. They describe the 
bad viewpoints of their model as an arrangement of curves on the unit sphere. 

In computer graphics, aspeet graphs HH are used to describe sets of view- 
points from which the two-dimensional images of a three-dimensional polyhedral 
solid have the same topology. Aspect graphs can be used to find viewpoints from 
which the maximal number of faces of a polyhedron are visible; in some sense, 
these viewpoints can be considered “best” . 

In a previous paper 0, we presented three models of good viewpoints: one 
that distinguishes between good and bad viewpoints, and two that assign con- 
tinuous measures of goodness to each viewpoint, leading to the notion of best 
viewpoints. Unfortunately, the complexity of the latter two models is too high 
for the calculation of theoretically-exact best viewpoints to be practical. In this 
paper, we address this problem by proposing fast approximation algorithms that 
yield “reasonably good” viewpoints. 

In Sect. 0 we briefly review the models of good viewpoints detailed in 0. 
In Sect. 0 we consider several criteria which algorithms for finding best view- 
points should satisfy. In Sects. 0 and 0 we present two classes of approximation 
algorithms and discuss their relative merits. We conclude in Sect. 0 with a dis- 
cussion of the experimental performance of these algorithms, and some examples 
of graph drawings viewed from their best viewpoints. 



2 Good Viewpoints 

A three-dimensional straight-line drawing D : V —> IR^ of an abstraet graph 
G = {V,E) associates a three-dimensional position {xi,yi,Zi) G IR^ with each 
vertex Vi € V. Each edge e^- is drawn as a line-segment between its endpoints. 
We use Vo to denote the set of isolated vertices; for a wire-frame drawing, Vq = 0- 

A three-dimensional graph drawing is mapped to a two-dimensional image via 
a projection. In this paper, we consider only orthographic parallel projections 0. 
An orthographic parallel projection is parameterised by its viewpoint direction - 
a vector from the origin in IR^ to a point p on the unit sphere. A two-dimensional 
image is formed by translating each point of the three-dimensional drawing, 
parallel to the vector p, onto a plane (the projection surface) that is perpendicular 
to p. The drawing can be clipped by a volume before projection - those portions 
of the three-dimensional graph drawing outside of this clipping volume do not 
appear in the resulting two-dimensional image. 

If a projection maps two three-dimensional points to the same two-dimen- 
sional point, then an occlusion occurs. We say the front point occludes the rear 
point. The concept of occlusion underlies many models of good viewpoints. 
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2.1 Bad Viewpoint Arrangements 

Definition 21 A good viewpoint is one from which the apparent abstract graph 
of the two-dimensional image is identical to the abstract graph of the three- 
dimensional graph drawing. 

For the purposes of this paper we assume that: a two-dimensional image is 
generated from a three-dimensional graph drawing using an orthographic parallel 
projection; there is no clipping; and all vertices and edges are mathematical 
ideals, with zero width for the purpose of calculating occlusions. Under these 
assumptions, the abstract graphs of a three-dimensional graph drawing and its 
two-dimensional image appear the same, if and only if there are no occlusions 
involving elements of the drawing. Viewpoints that result in occlusions are bad 
viewpoints; b) denotes the set of bad viewpoints from which a occludes b. 

There are four main types of occlusions: vertex-vertex occlusions, vertex- edge 
occlusions, edge-vertex occlusions, and edge-edge occlusions. If Vi and vj are both 
isolated vertices, then tp(vi,Vj) is called an isolated-vertex occlusion. Edge-edge 
occlusions only affect the apparent abstract graph if the corresponding two- 
dimensional edges overlap (intersect at more than one point). 

Observation 22 A good viewpoint is one that does not generate any isolated- 
vertex, vertex-edge, or edge-vertex occlusions. 

The set of bad viewpoints corresponding to a three-dimensional graph draw- 
ing can be represented by a collection 'P of occlusion curves on the unit sphere. 
Using the techniques described in Bose et al. |21, we construct an arrangement P| 
of the elements in W. We refer to this structure as a bad viewpoint arrangement. 

Lemma 23 |5J For a given three-dimensional graph drawing, we can build the 
corresponding bad viewpoint arrangement S in OdiFl log |!F| -|- k) time, where 
\F\ = |Vb|(|Vo| — 1) -I- 2|U||if|. The parameter k is the number of intersections 
between elements ofW, which is 0(|VbP -b |Up|Up) in the worst case. The size 
ofS is 0{\F\ + k). 

2.2 Rotational Separation Diagrams 

A bad viewpoint arrangement allows us to determine whether a given viewpoint 
results in an occlusion that affects the apparent abstract graph of a drawing. 
However, in itself, a bad viewpoint arrangement does not tell us from which 
points it would be best to view the drawing. To address this issue, we have 
developed two measures of goodness over the set of viewpoints. The first of these 
is the rotational separation measure. 

Let S{p,p') denote the great-circle distance between two viewpoints p and p' . 
We define Grsdip,'f’{o.,b)) to be minp/g,^(a i5(p,p')- The rotational separation 
0rsd(p,tf') of p is Grsdip,'f’{a,b)). If this minimum value is achieved 

for ip{a', b'), then we say that if{a', b') determines the goodness of p. A rotational 
separation diagram is a variant of a Voronoi diagram jSj that uses rotational 
separation as its distance function. 
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Lemma 24 A rotational separation diagram can he constructed in 0(|5'| log 151) 
time, where S is the corresponding bad viewpoint arrangement. The resulting 
diagram can he used to calculate ^rsd(p, tf') in logarithmic time. 

A rotational separation diagram can also be used to the find best viewpoints 
of a three-dimensional graph drawing under the rotational separation measure. 
Clearly, the goodness value Grsd{p, S) increases as p moves away from the nearest 
bad viewpoint(s) in S. This increase is maximised locally, either at a Voronoi 
vertex of the rotational separation diagram, or in some cases, at a unique internal 
point of a Voronoi edge. 

Theorem 25 Given a rotational separation diagram, we can find all locally-best 
viewpoints in 0(|5|) time, where S is the corresponding bad viewpoint arrange- 
ment. We can also find a locally-best viewpoint, nearest by great-circle distance 
to a given viewpoint, in logarithmic time. 

Best viewpoints under the rotational separation measure maximise the a- 
mount by which the viewpoint can rotate before an occlusion is generated. This 
can be beneficial for interactive applications, especially those that use “rocking” 
to achieve the illusion of three-dimensions 0. 

2.3 Observed Separation Diagrams 

Our second continuous measure of goodness is observed separation. We define 
Gosd{p,'ficL,b)) to be the minimum distance between the images of graph el- 
ements a and b when viewed from the viewpoint p. The observed separation 
GosdfPi'H^) of p is 0osd(p, '0 (q, ^))- An observed separation diagram is 

a Voronoi diagram that uses observed separation as its distance function. 

A tight bound for the worst case size of an observed separation diagram 
is currently an open problem. An upper bound is known for the 

restricted case of three-dimensional point sets (A = 0), where a is the inverse of 
Ackermann’s function A bad viewpoint arrangement S exists which yields an 
f2(|5p) lower bound; however, it is not yet known whether such an arrangement 
can be derived from a three-dimensional graph drawing. 

Best viewpoints under the observed separation measure maximise the user’s 
ability to resolve elements in the two-dimensional image of a three-dimensional 
drawing. Example drawings (such as those in Fig.|^ suggest that best viewpoints 
under observed separation are superior to those under rotational separation for 
the static display of three-dimensional graph drawings. 

3 Approximate Solutions 

We propose that, for a viewpoint-finding algorithm to be useful, it should satisfy 
these criteria: 

Quality - The viewpoint found should be “equivalent” to a theoretically-exact 
best viewpoint. Exactly what it means for two viewpoints to be “equivalent” 
varies among applications and users. If our output device has finite resolution. 
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then equivalence can be quantified according to the threshold angle below 
which a given three-dimensional point is expected to map to the same pixel 
in both of the resulting two-dimensional images. This threshold angle is 
equivalent to half the minimum angle of rotation 9 that would result in a 
vertex Vi shifting from one edge of a pixel in the two-dimensional projection 
to the opposite edge of the same pixel. Hence, on an n x n pixel display, two 
viewpoints can be considered “equivalent” when the angle between them is 
less than ^ = arcsin - . 

Locality - The viewpoint p' found should be “close” to the user’s specified view- 
point p. This criterion is important to help preserve the user’s mental map 
of the three-dimensional graph drawing. One may set a tolerance radius e for 
the change in viewpoint 6{p,p'). By restricting p' to satisfy 6{p,p') < e, we 
ensure that the user’s mental map is preserved. Alternatively, if S{p,p') > e, 
then the movement can be animated such that the change of viewpoint be- 
tween two frames is always within the radius e. This introduces the notion 
of two tolerance radii: e\, the maximum change allowed over the entire ani- 
mation; and C 2 , the maximum change allowed in any step of the animation. 

The application of a viewpoint-finding algorithm occurs in two phases. In 
the first phase, a three-dimensional graph drawing is preprocessed to build a 
data structure which identifies its best viewpoints. In the second phase, this 
data structure is repeatedly queried, to find viewpoints that satisfy the above 
criteria. These two phases suggest two additional criteria. 

Preprocessing Speed - Preprocessing should be fast enough to allow access to the 
query algorithm within a “reasonable time”. Ideally, the algorithm should 
be available as soon as the drawing is loaded into the application. To achieve 
this, preprocessing can be carried out when the drawing is generated, and the 
resulting data structures loaded into the application along with the drawing. 
However, applications that generate graph drawings interactively must still 
perform preprocessing “on the fly” . 

Query Speed - The query algorithm should return a viewpoint “quickly” . Ide- 
ally, a viewpoint should be found in no more time than it takes to render the 
drawing. Rendering a graph as a simple line-drawing usually requires time 
linear in the number of graph elements |G| = \V\ + \E\. 

The algorithms for finding best viewpoints given in Sects. ^3 and ^3 return 
best viewpoints within the precision used to build the corresponding diagrams. 
This easily satisfies the quality criterion for any reasonable precision. These al- 
gorithms also satisfy the locality criterion, in the sense that they return the 
locally-best viewpoint nearest by great-circle distance to a given viewpoint. The 
query speed criterion is also satisfied - indeed over-satisfied - as these algorithms 
return a viewpoint in logarithmic time. However, these algorithms require exten- 
sive preprocessing before their query algorithms can be used. For a given graph 
G, preprocessing can require log |G|) time under the rotational separa- 

tion measure, and even more under the observed separation measure. These time 
requirements are clearly too large for these algorithms to be useful in practice. 
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In order to reduce the computational cost associated with finding a locally- 
best viewpoint, we present algorithms that relax the quality criterion to find 
“reasonably good” viewpoints, while satisfying the remaining criteria. 

4 Iterative Improvement Algorithms 

Iterative improvement m is a simple search technique used to find points in 
a given space that optimise a given function. An iterative improvement algo- 
rithm works by repeatedly selecting a trial point and evaluating the optimisa- 
tion function, retaining the point which yields the best value. The choice of the 
optimisation function, the way in which a trial point is chosen, and the way in 
which the decision is made to terminate, all influence the running time of the 
algorithm, and the quality of the solution produced. For the problem at hand, 
the optimisation function is either of the goodness measures, denoted Q{p,If). 

Various methods can be used to choose a trial viewpoint. Choosing a view- 
point p' at random from the set of all viewpoints {blind random search) does 
not satisfy the locality criterion, as the initial viewpoint has no effect on the 
final viewpoint. A better method is to choose a trial viewpoint p' from within 
the intersection of two limiting circles: one centred on the initial viewpoint, with 
radius ei; and one centred on the current viewpoint, with radius £2 (recall that 
£i and £2 are the tolerance radii that preserve the user’s mental map). 

Calculating Q{p,I') requires 0(|Gp) time. It follows that t should be in 
0(l/|Gp) to satisfy the query speed criterion. However, for large graph draw- 
ings, t may become too small to find a “reasonably good” viewpoint within this 
time. This problem can be (partially) solved by animating the algorithm. 

Algorithm 1 Animated Iterative Approximation 

Inputs.' initial viewpoint p; goodness function Q; occlusion curves <F; 
limiting radii £i,£2,' number of iterations t. 

Outputs.' /inaZ viewpoint. 

1. Limiting centre ci <— p. 

2. While p is not “reasonably good”: 

(a) Limiting centre C2 <— p. 

(b) For t iterations: 

i. Choose p' 7^ p, such that S{ci,p') < £1, and S{c2,p') < £2. 
a. It Q{p' ,L') > Q{p,L'), then p ^ p' . 

(c) Display the graph drawing from the viewpoint p. 

3. Return p. 

On each step of the animation, t trial viewpoints are assessed, requiring 
0(t|Gp) time; then the graph is displayed. Between steps, the centre of the 
inner limiting circle moves to the current viewpoint p. Alternatively, the inner 
limiting circle can be moved each time a better viewpoint is found. 

Explicitly calculating whether the current viewpoint p is “reasonably good” 
is difficult, as there is no efficient way of comparing p against the theoretically- 
exact best viewpoints of the graph drawing. Instead, we use heuristics to decide 
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Fig. 1. Regions of better viewpoints within a limiting circle. 



whether p is “reasonably good” . Highly interactive applications can continue 
searching for a better viewpoint until the user interrupts the process, placing the 
burden of determining what is “reasonably good” on the user. Other applications 
can terminate the while-loop when the current viewpoint p does not change for 
a set number of trials. We use (c, e) to denote a lesser circle, centred on c, 
with radius e. Let B be the region of viewpoints p' within (02,62) such that 
Q{p',^) > Q{p,^), let the variable R = be the proportion of the area of 

(c2, 62) taken up by B, and let T be the random variable of the number of trials 
in which p has not changed. It can be shown that 

E[T] = 4-1 and Pr[T >t\R>k]<{l-kf . 

R 

If p is within a small distance of an occlusion curve, then R > 0.5 (as illustrated in 
Figs.Qt and Ct) . It follows that the iterative approximation algorithm moves the 
current viewpoint quickly away from bad viewpoints. However, as p approaches 
a locally-best viewpoint, R approaches 0 , and the expected number of trials 
needed to improve on the current viewpoint increases dramatically. 

To avoid the situations in which R approaches 0 , when the viewpoint p has 
not changed for a set number of trials, the inner limiting radius 62 is changed in 
an attempt to increase the value of R. There are two cases to consider: 

1 . If R is relatively small and is wholly contained within (c2, 62), then decreasing 
62 makes (02,62) a tighter approximation to B (see Fig.^J;). 

2 . If the current viewpoint is on a “spike” of B, and B is a small part of a larger 
region of better viewpoints, then increasing 62 can increase the proportion 
of this region in (02,62) (see Fig. 01 ). 

Obviously, these two cases conflict, and it cannot be determined with certainty 
which case applies for any given viewpoint. Our approach is to use a dynamic 
inner limiting radius 62, initially set to 62. At each step, if p does not change, 
then 62 is decreased. Otherwise, 62 is increased, but is never set higher than 62. If 
62 reaches a lower limit eg (determined by the “equivalence” angle of the quality 
criterion), then the algorithm decides that it has achieved a “reasonably good” 
viewpoint and terminates. 
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4.1 Pruning 

The iterative improvement algorithm described so far is a practical method for 
approximating the best viewpoints of small three-dimensional graph drawings. 
However, for larger graph drawings, the 0 (|<f'|) time taken to compute the good- 
ness of each trial viewpoint becomes too great. One approach to this problem 
is to prune those occlusions curves which cannot possibly determine the good- 
ness of a trial viewpoint. Let {c,e)) = minpg(c,e) V'(o, &)) and 

^/max(V'(a, b), (c, e)) = maxpg(c,e) G{p, ip{a, b)). Initially, we prune W to yield 



iJ'i = b) 



^min (^/i(a, 6 ), (ci,ei)) < min G 

n 






The time taken to generate is 0 (|'?'|). This cost is paid at the beginning of 
each query, after which each viewpoint can be evaluated in 0 (<f'i) time. If £2 is 
significantly smaller than ei , then a second stage of pruning can further improve 
the running time of our algorithm. At the start of each animation step, we can 
prune by the inner limiting circle (c2,C2) to yield tf'2- This second pruning is 
advantageous when < 1 — ©(y), where t is the number of trials per step. 



4.2 Clipping 

While calculating ^(p, !?'), the bad viewpoint p' that minimises G{P:'l') is implic- 
itly identified. It is natural to expect that, if a better viewpoint than p exists, 
then it should lie in the direction from p opposite to that of p' . This intuition is 
supported by the observations in Fig. Q. A worthwhile heuristic then, is to re- 
strict the choice of trial viewpoints to those viewpoints in that half-disc of (c2, £2) 
that lies furthest from p' . This single- clipping heuristic potentially doubles the 
probability of generating a better viewpoint on any given trial. 

The probability of generating a better viewpoint may be further increased 
by also considering the bad viewpoint p" that results in the second (possibly 
equally) minimal value of ^(p, S'). If the goodness values implied by p' and p" 
are approximately equal, then the choice of trial viewpoint is restricted to those 
viewpoints in the intersection of the two half-discs of (c2, £2) that lie furthest from 
p' and p", respectively. This double- clipping heuristic is particularly effective at 
increasing the probability R when the current viewpoint is in a “spike” (Fig.^). 

Unfortunately, in some situations, these clipping heuristics can result in all 
viewpoints better than p being excluded from the set of trial viewpoints. To allow 
for this possibility, if no improvement in the current viewpoint occurs within a 
set number of trials, then the clipping heuristics are disabled, until such time as 
a better viewpoint is found. 



5 Force-Directed Algorithms 

Force-direction is a well-established paradigm in the automatic layout of graph 
drawings p 4 l I l)j . Force-directed algorithms model graph drawings as physical sys- 
tems - they calculate the forces applied to a vertex of the system by all other 
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vertices and edges, then moving the vertex in the direction resulting from the 
combination of these forces. Using force-directed methods to find best viewpoints 
is somewhat simpler, as this only requires movement of the current viewpoint. 

The force calculations used to find best viewpoints are based on the double- 
clipping heuristic described earlier. The force applied by a bad viewpoint on the 
current viewpoint p is directed along the great-circle arc from the bad viewpoint 
through p. Let p' and p" be two bad viewpoints from distinct occlusion curves 
that determine the two smallest value of Q{p,W), such that G{p,p') < G{p,p”)- 
If G{p,p') ~ G(p,p”), then the movement of p is directed by the average of the 
forces applied by both p' and p"; otherwise, only p' is used. In the unlikely event 
that the forces applied by p' and p" exactly cancel each other out, the unwanted 
stability is avoided by means of a small random shift in viewpoint. 

Once a direction has been decided, we must determine the distance ea < €2 by 
which to move the current viewpoint. We base our approach on that of Brufi and 
Frick P). Initially, we use a small distance £3. On all subsequent moves, we adjust 
the value of £3: increasing it if the current move is in the same direction as the 
previous move, and decreasing it if the current move is in the opposite direction. 
The small initial value of £3 is chosen to prevent the viewpoint moving out of its 
cell in the bad viewpoint arrangement. This helps to satisfy the locality criterion, 
independent of the choice of an outer limiting radius £1. If the viewpoint jumps 
outside its cell, then the algorithm can stabilise at a locally-best viewpoint far 
from the initial viewpoint (but still within £1). 

Algorithm 2 Force-Directed Approximation 

Inputs.' initial viewpoint p; goodness function G; occlusion curves 'P; 
limiting radii £i,£2,C3)ee/ number of moves t. 

Outputs.' /inaZ viewpoint. 

1 . Limiting centre ci <— p; direction d' <— 0; limiting radius £3 <— £3. 

2 . While £(i > eg: 

(a) Limiting centre C2 <— p. 

(h) For t moves: 

i. Find a bad viewpoint p' G W, that minimises G(j>,p'). 

a. Find a bad viewpoint p" p' G F, that minimises G{p,p"). 

Hi. If G{p,p’)~G{p,p"), then d^ direct{p,p’ ,p"), else d^ direct {p,p'). 

iv. p ^ p + e'^d. 

V. Clip p by (ci,£i) and (c2,£2). 

vi. If d ^ d' , then £3 ^ min(mcrea5e(e3), € 2 ). 

vii. If cZ ~ —d', then £3 ^ mm(decrease{e'^) , £3). 

via. d' ^ d. 

(c) Display the graph drawing from the viewpoint p. 

3 . Return p. 

5.1 Randomised Force-Direction 

Like the iterative improvement algorithm, the force-directed algorithm requires 
□ (•f") time to calculate each move. A first stage of pruning can be used to decrease 
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the number of occlusion curves to be considered in each move; however, a second 
stage of pruning is seldom effective, as there are often only a few moves before 
the viewpoint reaches the edge of the inner limiting circle. 

Another approach, which proves to be highly effective in reducing the time 
taken to calculate each move, is randomisation nni. When processing the initial 
viewpoint, rather than using all the occlusion curves in 'J', we use a random 
sample of occlusion curves chosen uniformly from S'. Of these, the occlusion 
curves that imply the worst w > 2 goodness values are retained in a list SV- 
After each move, a new random sample is taken from S' and used to update 

The randomised force-directed algorithm behaves identically to the determin- 
istic version as long as the two bad viewpoints most restrictive to the current 
viewpoint are on occlusion curves contained in If we assume that the cur- 
rent viewpoint is static, then the expected number of moves until both of these 
occlusion curves are discovered is at most If the current viewpoint moves, 
then the two most restrictive occlusion curves can change. In practice, even if w 
is chosen to be a constant such as 10, the two most restrictive curves are almost 
always found in tfv, due to the restriction imposed on each move by 62- 

The time required to calculate each move is 0(|S^s| -I- wlog w). If w is chosen 
to be a constant, then the expected work done to find the two most restrictive 
occlusion curves is ^[0(|S^s|) = 0(l^'l)-i after which the algorithm is expected 
to converge to a locally-best viewpoint. Ideally, to satisfy the query speed cri- 
terion, we would like |>Z's| in 0(|G|). However, the expected number of moves 
before the algorithm finds the two most restrictive bad viewpoints could then be 
as large as 0(|G|). Rather, we choose [tf'sl in Odtf'l^); the expected number of 
moves is then no larger than 0 (a/|G|). The time taken by each move is 0(|G| 2 ) 
for sparse graphs {\E\ in 0(|y|)), and 0(|G|t) for dense graphs {\E\ in 0(|Rp)). 

6 Experimental Results 

In this section, we present some experimental results on the performance of our 
approximation algorithms. All results were obtained using a set of randomly 
generated three-dimensional graph drawings. Vertices were chosen uniformly at 
random within the unit sphere, and edges were then chosen uniformly at ran- 
dom from the set of all possible edges on these vertices. The algorithms were run 
several hundred times on each drawing, using randomly generated initial view- 
points, and the averages taken over these runs. In this paper, we limit ourselves 
to the rotational separation measure; initial results for the observed separation 
measure are similar. 

Figure 0 shows the time taken for the iterative approximation algorithm to 
terminate, plotted against |G|, and the number of trials in each animation step. 
These values were generated using ei = §, £2 = 55, and terminating when 
the viewpoint remained static for 54 trials; this number was chosen as it is the 
number of trials needed to reduce | to arcsin( when e '2 is reduced by 10% on 
each unsuccessful trial. The time taken for the algorithm to terminate increases 
rapidly as the number of trials per step increases from 1. This increase peaks at 5, 



220 



Michael E. Houle and Richard Webber 




Fig. 2. Varying the trials per 
step. 



Fig. 3. Varying convergence 
rate. 



beyond which the time taken appears to be relatively independent of the number 
of trials per step. It follows that (if pruning is not used) the inner limiting circle 
should be updated as soon as a better viewpoint is found. 

Figure 0 shows the effects of altering the inner limiting radius on the time 
taken for the iterative approximation algorithm to terminate. These results were 
generated using ei = 62 = and terminating when < arcsin(j(^). The 
inner limiting radius was multiplied by the convergence parameter k on each 
unsuccessful trial, and multiplied by ^ on each successful trial. The parameter k 
was varied between 0.01 and 0.99. The time taken to terminate increases slightly 
super-linearly in terms of |G|, but exponentially in terms of k. Based on this 
figure, one might be tempted to choose a very low value for k. Unfortunately, as 
k decreases, the chance of the algorithm terminating before a “reasonably good” 
viewpoint is reached increases. 

Figure^ shows the effectiveness of pruning, measured by the ratio plot- 
ted against ei and |G|. These values were generated by pruning with a limiting 
circles of radius ei, centred on viewpoints chosen uniformly at random from the 
unit sphere. The value of ei was varied between 0.1 and 1.5. As expected, the 
effectiveness of pruning increases (the ratio decreases) as ei decreases. For 
small graph drawings, the effectiveness of pruning varies significantly, reflecting 
its dependence on the exact configuration of the occlusion curves. For larger 
drawings, the effectiveness of pruning appears to be independent of |G|. 

Figure Eb shows the effectiveness of pruning, measured by the time taken for 
the iterative approximation algorithm to terminate when two stages of pruning 
are used. These results were generated using ei = f , 62 = and k = 0.9. 
The number of trials per step was varied between 1 and 10. As the number of 
trials per step increases, there is a corresponding decrease in the time taken for 
the algorithm to terminate, relative to the size of the graph. To understand this 
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Fig. 4. The effectiveness of pruning. 



behavior, recall that the work done in each animation step is dominated by the 
second stage of pruning, which is amortised over the number of trials. It follows 
that the effective cost of pruning can be reduced, by increasing the number of 
trials per step. However, for small drawings, the number of trials per step needed 
to make pruning worthwhile can result in many of these trials being wasted. 

Finally, Figs. Et and Eb show the time taken for the force-directed algo- 
rithm to terminate, using the deterministic and randomised approaches, respec- 
tively. These results were generated using ei = f , C 2 = gg, £3 = ggg, and 
terminating when < arcsin The distance functions were set such that 

decrease(e 3 ) = and increase(e 3 ) = p. The convergence parameter k' was 

varied between 0.1 and 0.99. The sample sizes were set to |<Fs| = , and 

w = 10. For large values of k', the time taken for these algorithms to termi- 
nate exhibits a slightly super-linear dependence on |G|. When |G| is fixed, as k' 
decreases from 1, the time taken decreases exponentially. This behaviour is anal- 
ogous to that exhibited by iterative approximation in Fig. El However, for small 
graph drawings with k' < 0.5, the performance of these algorithms degrades and 
becomes erratic - this is especially true of the randomised algorithm. This occurs 
because the low value of k' causes the viewpoint to jump from cell to cell of the 
bad viewpoint arrangement. For small drawings, this behaviour can persist for 
many moves. For larger drawings, the cells of the bad viewpoint arrangement 
are usually too small for this behaviour to impact on the average performance. 
The results indicate that, as long as ^ is set to some small proportion of |G|, 
the force-directed algorithms provide a fast and reliable means of finding best 
viewpoints. 

To summarise, both the iterative improvement and force-directed approaches 
result in useful algorithms for finding “reasonably good” viewpoints for three- 
dimensional graph drawings. Both approaches benefit from the use of a dynamic 
inner limiting radius, as long as the convergence parameters are set sufficiently 
close to 1 to ensure a “reasonably good” viewpoint is reached. When applied to 
large graph drawings, the iterative improvement approach can benefit from the 
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Fig. 5. The performance of force-directed approximation. 



use of pruning. Similarly, the force-directed algorithms can benefit from the use 
of randomisation. Indeed, the randomised force-directed algorithm out-performs 
the other algorithms presented in this paper by a significant margin. 

We conclude with several examples of three-dimensional graph drawings, 
viewed from their best viewpoints. 
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Fig. 6. Some example graph drawings viewed from their best viewpoints. 



7. P. D. Eades, M. E. Houle, R. Webber (1997): “Finding the Best Viewpoints 
for Three-Dimensional Graph Drawings” in Proc. 5th Int. Symp. Graph Drawing 
(Rome); Springer- Verlag, LNCS', 1353:87-98 

8. H. Edelsbrunner (1987): Algorithms in Combinatorial Geometry, Springer- Verlag 

9. J. D. Foley, A. van Dam, S. Feiner, J. Hughes (1990): Computer Graphics: 
Principles and Practice, 2nd ed.; Addison- Wesley 

10. T. M. J. Fruchterman, E. M. Reingold (1991): “Graph Drawing by Eorce- 
Directed Placement” in Software - Practice and Experience] 21(11):1129-1164 

11. T. Kamada, S. Kawai (1988): “A Simple Method for Computing General Posi- 
tion in Displaying Three-Dimensional Objects” in Computer Vision, Graphics and 
Image Processing-, 41(l):43-56 

12. K. Misue, P. D. Fades, W. Lai, K. Sugiyama (1995): “Layout Adjustment and 
the Mental Map” in J. Visual Languages and Computing-, 6:183-210 

13. R. Motwani, P. Raghavan (1995): Randomized Algorithms-, Cambridge Univer- 
sity Press 

14. P. H. J. M. Often, L. P. P. P. van Ginneken (1989): The Annealing Algorithm-, 
Kluwer Academic 

15. H. Plantinga, C. R. Dyer (1990): “Visibility, Occlusion, and the Aspect Graph” 
in Int. J. Computer Vision; 5 (2): 137-160 

16. C. Ware, G. Franck (1996): “Evaluating Stereo and Motion Cues for Visualizing 
Information Nets in Three Dimensions” in ACM Trans. Graphics; 15(2):121-140 

17. D. R. Wood (1998): “Two-Bend Three-Dimensional Orthogonal Grid Drawing of 
Maximum Degree Five Graphs”; in this proceedings 



Level Planarity Testing in Linear Time* 



Michael Jiinger^, Sebastian Leipert^, and Petra Mutzel^ 

^ Institut fiir Informatik, Universitat zu Koln, 50969 Koln, Germany, 
m j uengerSinf ormat ik . uni-koeln . de 
^ Institut fiir Informatik, Universitat zu Koln, 50969 Koln, Germany, 
leipertOinf ormat ik . uni-koeln . de 
® Max-PIanck-Institut fiir Informatik, 66123 Saarbriicken, Germany, 
mutzel@mpi-sb.mpg . de 



Abstract. In a leveled directed acyclic graph G = {V, E) the vertex set 
V is partitioned into A: < |U| levels Vi, V 2 , ■ • ■ , Vfc such that for each edge 
(u,v) € E with u £ Vi and v £ Vj we have i < j. The level planarity 
testing problem is to decide if G can be drawn in the plane such that for 
each level Vi, all v £Vi are drawn on the line h = {(r, k — i) \ x £ R}, the 
edges are drawn monotone with respect to the vertical direction, and no 
edges intersect except at their end vertices. If G has a single source, the 
test can be performed in 0(|U|) time by an algorithm of Di Battista and 
Nardelli (119331) that uses the PQ-tree data structure introduced by Booth 
and Lueker (|i9Y^. PQ-trees have also been proposed by Heath and 
Pemmaraju (1996a,b) to test level planarity of leveled directed acyclic 
graphs with several sources and sinks. It has been shown in Jiinger, 
Leipert, and Mutzel (I199YI) that this algorithm is not correct in the 
sense that it does not state correctly level planarity of every level planar 
graph. In this paper, we present a correct linear time level planarity 
testing algorithm that is based on two main new techniques that replace 
the incorrect crucial parts of the algorithm of Heath and Pemmaraju 
(1996a,b). 

1 Introduction 

A fundamental issue in Automatic Graph Drawing is to display hierarchical 
network structures as they appear in software engineering, project management 
and database design. The network is transformed into a directed acyclic graph 
that has to be drawn with edges that are strictly monotone with respect to the 
vertical direction. Most applications imply a partition of the vertices into levels 
that have to be visualized by placing the vertices belonging to the same level on 
a horizontal line. These graphs are called leveled graphs. Testing whether such 
a graph is level planar, i.e. can be drawn without edge crossings, was solved by 
Di Battista and Nardelli (1198811 for leveled graphs with a single source using the 
PQ-tree data structure. 

* Supported by DFG-Grant Ju204/7-2, Forschungsschwerpunkt “Efliziente Algorith- 
men fiir diskrete Probleme und ihre Anwendungen” 
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PQ-trees have also been proposed by Heath and Pemmaraju ( 1996a, b) to test 
level planarity of leveled directed acyclic graphs with several sources and sinks. 
It has been shown in Jiinger, Leipert, and Mutzel that this algorithm is 

not correct in the sense that it does not state correctly level planarity of every 
level planar graph. In this paper, we present a correct linear time level planarity 
testing algorithm that is based on two main new techniques that replace the 
incorrect crucial parts of the algorithm of Heath and Pemmaraju ( 1996a, b). 

This paper is organized as follows. In the next section we give a short intro- 
duction to the PQ-tree data structure and the level planarity test presented by 
Heath and Pemmaraju (1996a,b) and we summarize the incorrect crucial parts 
of this algorithm. In the third section we present a correct algorithm and show 
how to obtain linear running time. In the last section we make some remarks on 
the construction of a level planar embedding based on our algorithm. 

2 Preliminaries 

Let G = (V,E) be a directed acyclic graph (dag). A leveling of G is a function 
lev : P — > Z mapping the nodes of G to integers such that lev(y) > lev{u) for all 
(m, v) G E. a leveling of G is called proper if lev{v) = lev{u) + l for all {u, v) G E. 
G is called a leveled dag if a leveling has been assigned to it. If lev(v) = j, then 
u is a level- j vertex. Let Vj = lev~^{j) denote the set of level- j vertices. Each Vj 
is a level of G. 

For the rest of this paper, we assume w.l.o.g. that G is a proper leveled dag 
with k G N levels. We will show in the last section, that our algorithm can be 
adapted to non proper leveled dags without any modification, preserving the 
linear running time. However, the algorithm is easier to understand for proper 
hierarchies. 

An embedding of G in the plane is called leveled if the vertices of every Vj , 
1 ^ J ^ fc) are placed on a horizontal line Ij = {{x,k — j) | a; G ffi}, and every 
edge (u,v) G E, u G Vj, V G Vj+i is drawn as a straight line segment between the 
lines Ij and Ij+i- A leveled embedding of G is called level planar if no two edges 
cross except at common endpoints. A leveled dag is level planar, if it has a level 
planar embedding. The dag G is obviously level planar, if all its components are 
level planar. We therefore assume that G is connected. 

A leveled embedding of G determines for every Vj, 1 < j < k, a, total order 
<j of the vertices of Vj, given by the left to right order of the nodes on Ij. In 
order to test whether a leveled embedding of G is level planar, it is sufficient to 
find an ordering of the vertices of every set Vj, 1 < j < k, such that for every 
pair of edges (ui, ui), (u 2 , U 2 ) G E with lev{u\) = lev{u 2 ) = j and u\ <j U 2 it 
follows that vi <j+i V 2 . Apparently, the ordering <j, 1 < j < k, describes a 
permutation of the vertices of Vj . Let Gj denote the subgraph of G, induced by 
Vj U Vj U . . . U V, . Unlike G, Gj is not necessarily connected. 

The basic idea of the level planarity testing algorithm presented by Heath 
and Pemmaraju (I996a,b) is to perform a top-down sweep, processing the levels 
in the order Vj, Vj, . . . , Vj and computing for every level Vj, 1 < j < fc, a set of 
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permutations of the vertices of Vj that appear in some level planar embedding of 
Gj . In case that the set of permutations of Gk is not empty, the graph G = Gk 
is obviously level planar. 

A PQ-tree is a data structure that represents the permutations of a finite 
set U in which the members of specified subsets occur consecutively. This data 
structure has been introduced by Booth and Lueker dTHTnil to solve the problem 
of testing for the consecutive ones property. A PQ-tree contains three types of 
nodes: leaves, P-nodes, and Q-nodes. The leaves are in one to one correspondence 
with the elements of U. The P- and Q-nodes are internal nodes. A P-node is 
allowed to permute its children arbitrarily, while the order of the children of a 
Q-node is fixed and only may be reversed. In subsequent figures, P-nodes are 
drawn as circles while Q-nodes are drawn as rectangles. 

The set of leaves of a PQ-tree T read from left to right is denoted by 
frontier(T) and yields a permutation on the elements of the set U . The frontier 
of a node X, denoted by frontier (A), is the sequence of its descendant leaves. 
Given a PQ-tree T over the set U and given a subset S C U, Booth and Lueker 
ITTJTn] developed a pattern matching algorithm called reduction and denoted by 
REDUCE(T, S) that computes a PQ-tree T' representing all permutations of T 
in which the elements of S form a consecutive sequence. 

If Gj is a hierarchy, the set of permutations of the vertices of Vj that appear in 
some level planar embedding of Gj can be represented by a PQ-tree Tj according 
to Di Battista and Nardelli ll I as follows: 

1. identify with every vertex of Vj exactly one leaf, 

2. identify with every cut vertex in Gj a P-node, 

3. identify with every maximal biconnected components in Gj a Q-node, 

In order to test whether the hierarchy Gj+i is level planar, Di Battista and 
Nardelli iliitssll add for every edge {vi,w), w G I^+i, Vi G Vj, i = 1,2, . . . , fi, 
/r > 1 a virtual vertex labeled w and virtual edges to the graph 

Gj. The authors then try to compute for every vertex w G Vj+i a sequence 
of permutations of components around cutvertices and swappings of maximal 
biconnected components such that all virtual vertices labeled w form a consec- 
utive sequence on the horizontal line Ij+i- If such a sequence can be found, it is 
obvious that the vertex w can be added to Gj without destroying level planarity. 
The process of computing the prescribed sequence can be efficiently done using 
the PQ-tree Tj, yielding a linear time algorithm. 

In case that Gj, 1 < j < k, consists of more than one connected compo- 
nent, Heath and Pemmaraju suggest to use a PQ-tree for every component and 
formulate a set of rules of how to merge components Pi and P 2 , respectively 
their corresponding PQ-trees Pi and P 2 , if Pi and P 2 both are adjacent to some 
vertex v G Vj+i . 

Heath and Pemmaraju (1996a,b) reduce during a First Merge Phase the 
leaves of Pi and P 2 corresponding to the vertex v, called the pertinent leaves. 
After successfully performing the reduction, the consecutive sequence of perti- 
nent leaves is replaced by a single pertinent representative in both Pi and P 2 . 
Going up one of the trees Pi, i G {1,2}, from its pertinent representative, an 
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appropriate position is searched, allowing the tree Tj, j ^ i, to be placed into Ti. 
After successfully performing this step the resulting tree T' has two pertinent 
leaves corresponding to the vertex v, which again are reduced and replaced by a 
single representative. If any of the steps fails, Heath and Pemmaraju state that 
the graph G is not level planar. 

Merging two PQ-trees T\ and T 2 corresponds to merging the two components 
F\ and F 2 and is accomplished using certain information that is stored at the 
nodes of the PQ-trees. For any subset S of the set of vertices in L^-, 1 < j < m, 
that belongs to a component P, define ML (S') to be the greatest d < j such 
that Vd, Vd+u ■ ■ ■ ) hj induces a dag in which all nodes of S occur in the same 
connected component. For a Q'^ode q in the corresponding PQ-tree Tp with 
ordered children ri,r 2 ,... ,rt integers denoted by ML(ri,ri+i), 1 < i < t, are 
maintained satisfying ML(ri,ri+i) = ML(frontier(ri) U frontier(ri+i)). For a P- 
node p a single integer denoted by ML(p) that satisfies ML(p) = ML (frontier (p)) 
is maintained. Furthermore, define LL(P) to be the smallest d such that F 
contains a vertex in Vd and maintain this integer at the root of the corresponding 
PQ-tree. The height of a component F in the subgraph Gj is j — LL(P). Using 
these LL- and ML- values. Heath and Pemmaraju (1996a,b) describe a set of rules 
how to connect two PQ-trees claiming that the pertinent leaves of the new tree 
T' are reducible if and only if the corresponding component F' is level planar. 

In Jiinger, Leipert, and Mutzel (ITW7I) we have shown that the order of merg- 
ing the components is important for testing a leveled dag. Moreover, it is easy 
to see that using different orderings while merging three or more components 
results in different PQ-trees. So even if every order of merging PQ-trees with 
pertinent leaves labeled v, lev{v) = J, 1 < J < k, results in a reducible PQ-tree, 
a PQ-tree may be constructed such that the leaves of some vertex /, lev{l) > j 
are not reducible, although the graph G is level planar. Hence the algorithm 
presented by Heath and Pemmaraju (1996a,b) may state incorrectly the non 
level planarity of a level planar graph. 

Furthermore, components of Gj, that have just one level-j vertex are not 
treated properly. In fact, they may be inserted at wrong positions in other PQ- 
trees. This is due to the fact that during the first merge phase the algorithm 
reduces for every PQ-tree all leaves with the same label and replaces them by a 
single representative. Clearly, this replacement corresponds to the construction 
of new interior faces in the corresponding subgraph. However, PQ-trees are not 
designed to carry information about interior faces, hence the information about 
the “space” within these interior faces gets lost. It is easy to see that situations 
may occur where components being adjacent to just one level-j vertex have to 
be embedded within one of these interior faces. The approach of Heath and 
Pemmaraju (I996a,b) does not detect this fact, which is another reason that it 
may incorrectly state the non level planarity of a level planar graph. 

Heath and Pemmaraju (1996a,b) claim that their algorithm can be imple- 
mented using only 0(|U|) time. This is true for the merge and reduce operations. 
However, considering two PQ-trees Pi, T 2 both having a leaf labeled v and a leaf 
labeled w, Heath and Pemmaraju (1996b) suggest to merge the trees Ti and T 2 
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at the leaves labeled v constructing a new PQ-tree T and then reduce T with 
respect to the leaves labeled v as well as with respect to the leaves labeled w. 
It is not clear how the update operations that are necessary for detecting both 
pairs of leaves can be done in 0(|y|) time, Heath and Pemmaraju (1996a, b) do 
not discuss this matter. 

We will combine two new strategies to eliminate the problems we encountered 
in the algorithm of Heath and Pemmaraju (1996a,b). 

3 A Correct Linear Time Level Planarity Test 

In this section we discuss how to construct a correct algorithm LEVEL-PLANAR- 
TEST that tests a leveled dag G = (Vi, V 2 , ■ • ■ , 14; A) for level planarity. Since 
Gj is not necessarily connected, let rrij denote the number of components of Gj 
and let Ff , i = 1,2,... , mj, denote the components of Gj. Number the vertices 
of level j -I- 1 arbitrarily from 1 to |V,+i | . We refer to the vertices of Vj+i by their 
numbers. Let Flf be the component formed by adding to A/ all edges with one 
end in and the other end in Vj+i, keeping the ends in l^-i-i separate. These 
edges are called virtual edges and their ends in Vj+i are called virtual vertices. 
The virtual vertices are labeled as their counterparts in Vj+i, but they are kept 
separate. Thus there may be several virtual vertices with the same label, adja- 
cent to different components of Gj and each with exactly one entering edge. The 
component F[^ is called the extended form of Ff and the set of virtual vertices of 
Flf is called frontier(iJ/). Let Bl be a level planar embedding of FIf . Obviously, 
all virtual vertices of Hf are placed on the same horizontal line on the outer face. 
The set of virtual vertices of that are labeled v G Vj+i is denoted by Sf . 
Figure n shows an example of an extended form Fl^ and its corresponding PQ- 
tree, representing all permutations of the virtual vertices that appear in some 
level planar embedding of . The form has two virtual vertices labeled v. 




Fig. 1. An extended form and its PQ-tree. 



The component that is created from an extended form Fl^ by identifying 
for various v G Vjj-i all virtual vertices with the label u to a single vertex Vi 
with label v is called reduced extended form and denoted by . The form is 
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called proper if for all v S the virtual vertices with the same label v have 
been identified, otherwise R\ is called sloppy. The set of virtual vertices of Rj is 
denoted by frontier(ii^). If the virtual vertices labeled v have been identified in 
Rj to a vertex Vi, we denote by = {vi} the set of vertices with label v of Rj. 
Figure 0 shows an example of a reduced extended form R^ and its corresponding 
PQ-tree. The form Rf has been constructed from the extended form Hf shown 
in Fig.Q by identifying the two virtual vertices labeled v. The corresponding 
PQ-tvee has been constructed by reducing the two leaves labeled v applying the 
pattern matching algorithm of Booth and Lueker fll 97(11 . 




Fig. 2. A proper reduced extended form R\ and its PQ-tiee. 



Identifying the sets yf 0 and Sf yf 0 of two reduced extended forms R^ and 
Rj, i I, i,l G {1,2, . . . , rrij}, to a single vertex with label v is denoted by 
R{ LlyR^. We call R{ UyRj a merged reduced component. If LL(iZ^) < LL(i?^) we 
say Rj is v-merged into Rj . The component that is created by u-merging Rj into 
Rj is again a reduced extended component and denoted by Rj (thus renaming 
Rj Rj with the name of the “higher” component). If Rj, i G {1,2,... ,mj} is 
a reduced extended component, such that 5'“ y^ 0 for some v G and S')" = 0 
for all w G Vj+i — {u}, then Rj is called v-singular. 

A collection C{Gj), I < j < k, denotes the set of level planar embeddings of 
all components of Gj. One of our results is that in case that Gj is level planar, 
a PQ-tree T{F^) can be associated with every P/ of Gj describing the set of 
level planar embeddings of P/. As has been shown in Booth and Lueker lIlPYhIl . 
it is straightforward to construct from T(P/) a PQ-tree T{Hj) associated with 
Hf. Thus the leaves of T{Hj) correspond to the virtual vertices of Hj and we 
label the leaves of T{Hj) as their counterparts in Hf . By construction, G{Gj) 
is a set of PQ-trees. Considering a function CHECK-LEVEL that computes for 
every level j, j = 2, 3, . . . ,k the set G{Gj) of level planar embeddings of the 
components Gj, the algorithm LEVEL-PLANAR-TEST can be formulated as 
follows. 

Bool LEVEL-PLANAR-TEST (G = (W, V 2 , ■ ■ • , Vfc; P)) 
begin 

Initialize G(Gi); 
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for j := 1 to fc — 1 do 

C{Gj+i) = CHECK-LEVEL(C(Gj),yj+i); 
if C{Gj+i) = 0 then 

return “G is not level planar.”; 
return “G is level planar.”; 
end. 



We introduce two new strategies that lead to a correct algorithm as well as 
new techniques for obtaining linear running time. One strategy is to sort all 
PQ-trees with a leaf labeled v in their frontier according to their LL-values 
and merge them according to this ordering. We show that the new PQ-tree 
constructed by the application of this ordering represents all possible level planar 
embeddings of the corresponding new component. Our second strategy for a 
correct treatment of i;-singular components consists of keeping at every single 
representative the size of the largest interior face that has been constructed by 
identifying the corresponding virtual vertices. When merging a PQ-tvee of a v- 
singular component into another PQ-tree with lower LL-value, this information 
is checked first. When merging two non singular components, this information 
has to be updated when introducing a new single representative. Here we have to 
take in account that merging two components results into something that we call 
a cavity. Considering the intersection C of the halfspace {x S \ X 2 '>k — j — 1} 
and the outer face of the current embedding, a w-cavity is defined to be a region 
of C such that v is adjacent to the region. Obviously v can be adjacent to several 
such regions. Moreover, these regions are not unique, since they depend on the 
current embedding. This is no drawback, since we only need to maintain the size 
of the largest x-cavity which can be easily implemented using the PQ-tvees, and 
the LL- and ML- values of Heath and Pemmaraju (1996a,b). Figure 0shows such 
a u-cavity. The arrow on the right side of the figure depicts the height of the 
cavity. A u-singular component can only be level planar embedded within this 
cavity, if it is smaller than the height of the cavity. 




height 

ri-cavity 



Fig. 3. A x-cavity. 



Level Planarity Testing in Linear Time 



231 



As we have mentioned in the previous section, merging two PQ-trees at 
leaves labeled v may result in a PQ-tree T with several leaves labeled w ^ v. 
Linearity of the algorithm is achieved by not applying the strategy of reducing 
the leaves labeled w, since it is not clear if the detection of these leaves reveals 
linear running time. We reduce these leaves labeled w only when considering their 
PQ-tree T for a merge operation at w. Thus we first merge all leaves labeled 
w in every tree and then merge these trees at w. We show that the modified 
algorithm works correctly. When merging PQ-trees, update operations have to 
be applied to the leaves of the new tree, since the leaves must know the PQ- 
tree that they belong to. To avoid the usage of Fast-Union-Find-Set operations 
which sum up to 0(|y|a(|y|, |y|)) operations, we apply the following strategy. 
Leaves are updated only when they are involved in a reduce or merge operation. 
In order to update the leaves, we traverse all nodes from the considered leaf to 
its root. Let U he a set of PQ-trees with leaves labeled w in their frontier. We 
show that if this strategy is applied for all leaves except for the leaves in the 
PQ-tree with the lowest LL- value in U, the number of operations is proportional 
to the number of operations needed to reduce all these leaves. We do not need to 
know the PQ-tree with the lowest LL-value in U. It is easy to see that this tree 
is implicitly defined. Hence we can avoid for every merge operation the traversal 
of the tree corresponding to the highest component. Thus the total number of 
operations needed to perform the updates is bounded by 0(|y|). 




Fig. 4. The forms Rj, i = 1,2,... ,5 are merged at vertex v. The u-singular 
components are drawn shaded and placed into an interior face of R\. The non 
singular components are merged according to their height. The vertices Vi, i = 
1,2,... ,5 and U{i, 2 } each correspond to a new leaf that replaces the sequence 
of pertinent leaves in the corresponding PQ-tree. 



The procedure CHECK-LEVEL is divided into two phases. The First Re- 
duction Phase constructs the PQ-trees corresponding to the reduced extended 
forms of Gj. Every PQ-tree T{F^) that represents all level planar embeddings 
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of some component is transformed into a PQ-tree T(Hi’) representing all 
level planar embeddings of the extended form Hj . We continue to reduce in ev- 
ery PQ-tree T{Hil) all leaves with the same label, thereby constructing a new 
PQ-tree, representing all level planar embeddings of H^, where leaves with the 
same label occupy consecutive positions. If one of the reductions fails, G cannot 
be level planar. Leaves with the same label v are replaced by a single represen- 
tative Vi- Such a single representative Vi gets the same label u, storing either a 
value PML(rii) = ML(P) if the root R of the pertinent subtree is a P-node or 
a value QML(ui) = min{ML(a;, y) | x,y consecutive children of R, x pertinent 
or y pertinent}, if the root P is a Q-node. The default value of QML(r>i) and 
PML(ui) is set to fc-l- 1. These values store the height of the largest new interior 
face that is constructed by merging the vertices labeled v and are needed to 
handle singular components correctly. 

PQ-trees of different components are merged in the Second Reduction Phase 
using a function INSERT, if the components are adjacent to the same vertex v on 
level j-l-1. Given the set of leaves labeled v, we first determine their corresponding 
PQ-trees. If some leaves labeled v are in the frontier of the same PQ-tree, we 
reduce them and replace them by a single representative. The PQ-trees are then 
merged pairwise in the order of their sizes. We show that using this ordering a 
PQ-tree T{F) is constructed, that represents all possible level planar embeddings 
of the merged components. If there is more than one u-singular reduced extended 
form, V G Vj+i, we only need to merge the largest one of these forms. If it is 
possible to embed this form level planar, all other u-singular forms obviously can 
be embedded level planar as well. Even though v may not be the only common 
vertex in the merged components, we do not reduce leaves with label w ^ v 
in the PQ-tree in order to obtain a linear time algorithm. If one of the reduce 
or merge operations fails while applied in this phase, the graph G is not level 
planar. Figure 01 illustrates the merge process. The PML- and QML-values are 
updated by using a function UPDATE. Finally we add for every source of Vj+i 
its corresponding PQ-tree. Thus the set of PQ-trees constructed by the function 
CHECK-LEVEL represents all level planar embeddings of the components Gj+i. 
The following code fragment contains operations that perform on the graph G. 
They are kept in the code for documentation purposes. Any implementation 
would of course rely only on the manipulation of the PQ-trees. 



C{Gj+i) CHECK-LEVEL(C(Gj), y,-+i) 
begin 

First Reduction Phase 

for every component P/ in Gj and its corresponding PQ-tree in T{F^) do 
construct Hf-, construct T{P[f); 
for every v G Vj+\ do 

for every extended form Flf do 
if S'} yf 0 then 

if REDUCE(P(P/), S}) = 0 then return 0; 
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else 

replace S'" in T{Hf) by a single representative vf, 

set PML(wj) or QML(tii); S" := {t>i}; 
for every extended form Hf do T{Rl) := T{Hf)] 

Second Reduction Phase 
for ti := 1 to \Vj+i \ do 

for every leaf labeled v do find the corresponding PQ-tree; 
for every found PQ-tree Tj, z G {1, 2, . . . , mj} do 
if S" > 2 then 

if REDUCE(T„S") = 0 then return 0 ; 
else 

let vt be a new single representative of S" ; 

UPDATE(S",z;t); replace S" in Ti by vt', S" := {wr}; 
let Rj, i := 1,2, . . . ,p, he the sloppy reduced extended forms; 
let o be the number of u-singular reduced extended forms; 
eliminate all u-singular R^ except for the one with the lowest LL- value; 
renumber the remaining R^ from 1 top — o-|-l;p:=p— o-|-l; 
sort the R{, such that LL(P{) < LL(P^) < LL(P^) < . . . < LL(P^); 

F := R{; T{F) := T(P{); 
for z := 1 to p — 1 do 

T(P) := INSERT(T(P),T(P^+i),z;); P := P U„ 
if REDUCE(T(P), S|,) = 0 then return 0 ; 
else 

let vp he & new single representative of S"^-, 

UPDATE(S];,,z;f); replace S}, in T(P) by vp', S}, := {zzf}; 
update the root pointers of the leaves; add all sources of Vj+i in G to H; 
add for every source a corresponding PQ-tree to C(Gj); 

C(G,+i) := G(G,); 
return G{Gj+i); 
end. 

We now describe in detail how to merge the PQ-trees corresponding to two 
components. All five rules presented by Heath and Pemmaraju can be adapted, 
but contrary to their algorithm, we have to deal with the fact that a PQ-tree 
may correspond to a singular component. Merging two PQ-trees is handled by 
the method INSERT. Let LL{Tiarge) and hL{Tsmaii) be two PQ-trees such that 
Sfarge ^ ® and yf 0 , and hhiTiarge) < hh{Tsmaii)- Assume further that 

^large ^smaii have been reduced and replaced by a single representative 
Vlarge reSp. Vsmall and that = {viarge} and = {Vsmall}- INSERT 

returns a new PQ-tree Tmerge- The method does not reduce the pertinent se- 
quence, nor does it replace pertinent leaves by a single leaf. Observe that in 
case frontier(Tsma/0 = <S'sma/;> do not really add Tsmaii to Tiarge, since the 
component corresponding to Tamaii can be embedded in an interior face or a 
u-cavity of the component corresponding to Tiarge- 
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Truer ge lNSERT{Tiarge ; small : 'll) 
begin 

if frontier(Ts„a/i) ^ S'^maii then 

attach Tsmaii to Tiarge as described in Heath and Pemmaraju (1996a, b); 
else if PML(z);arge) ^ k + 1 then 

if VML{viarge) < LL(Tsma/i) then do nothing; 
else attach Tsmaii to Tiarge as described in Heath et al. (1996a, b); 
else if QML(ti;arge) ^ k + 1 then 

if QML('i;/arge) < LL(Tsj„a/;) then do nothing; 
else attach Tsmaii to Tiarge as described in Heath et al. (1996a,b); 
return the new PQ-tree Tmerge', 
end. 



The method UPDATE is applied after two PQ“trees have been successfully 
merged and reduced at their leaves labeled v. UPDATE computes the PML- and 
QML- value of the new single representative. The values PMLr and QML,,. that 
appear in the code, are used to compute the height of the largest new cavity. 



UPDATE(S'}J,,i;f) 

begin 

PMLmin := min{PML(l;) | v G Sp}', := min{QML(tj) | v G Sp}] 

let r be the root of the pertinent subtree; 
if r is a P-node then PMLj. := ML(r); 
else if r is a Q-node then 

X, y consecutive children of r, 

X pertinent or y pertinent 

if min{PMLrnin, PMLr} < min{QML^j„, QML^} then 
PML(i;f) := min{PML„ii„, PMEJ; QML(uf) :=k + l; 
else 

QML(z;f) := min{QML,„i„, QML J; PML(uf) :=k + l; 

end. 



QML^ := min < ML(a;, y) 



The following theorem shows the correctness of our level planarity test. 

Theorem 1. Let G = {V,E) be a directed level planar graph with k >2 levels. 
The algorithm LEVEL-PLANAR-TEST tests G for level planarity. 

Proof. We use an inductive argument. Clearly, every component F of G that is 
induced by a source and its neighbors is a hierarchy. According to Di Battista and 
Nardelli II I we have a PQ-tree for every component P representing all level 
planar embeddings. So we need to show that in every iteration, the PQ-trees 
are correctly maintained and the set of permissible permutations of a PQ-tree 
always represents the set of level planar embeddings of the corresponding form. 

Let P/, i G {1,2,... ,mj}, be an arbitrary component of Gj, f < j < k, 
Hf be its extended form and Rf be its proper reduced extended from. It is 
straightforward to show that if Ti is a PQ-tree representing all level-planar 
embeddings of , then there exists a PQ-tree P/, equivalent to Ti, such that 
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for all n G {1,2,... , |P,+i |}, the leaves corresponding to occupy consecutive 
positions. Thus the PQ-tree constructed from by reducing every set S'" and 
replacing it by a single representative Vi represents all level planar embeddings 
of Rl implying the correctness of the first merge phase. 

Now let V be an arbitrary vertex of level j+1, where j < k. Let P{ , P 2 ; ■ • ■ 7 
p > 2 be reduced extended components with their virtual vertices on level 
j + 1 kept separate such that S" yf 0 for all i = 1,2,... ,p and |S™| < 1 
for all w = 1,2, ... and i = 1,2, ... ,p. Let F be the component induced 
by P{ , P 2 ) • • • ) -Rp with V being the only identified vertex. All other vertices 
with common label are kept separate. Assume w.l.o.g. that LL(P{) < LL(P 2 ) ^ 
LL(P 3 ) < . . . < LL(pl). Assume further, that F is constructed by first merging 
R{ and R 2 to 2 } identifying the sets of virtual vertices S{ and S 2 to one 
vertex v, and then merging for every z = 3,4, . . . ,p the reduced extended forms 
P|i 2 and Rj to ^ .j identifying the sets of virtual vertices S'" to 

V. Let Ti,T 2,... ,Tp be the PQ-trees corresponding to R\,R2,... , R^p. It can 
be shown by induction on the number of components that the PQ-tree T(F) 
constructed by using the function INSERT on all Tj, in the order 1,2,... ,p, 
reducing the leaves labeled v after every merge step and replacing them with a 
single representative represents all level planar embeddings of F. When proving 
the induction we differentiate between u-singular components and nonsingular 
components. In the latter case let Sz, 1 < z < p, be the set of virtual vertices of 
Rj except for vertex v, and let S{i 2 ,,,, ,i}> 1 < * < P, be the set of virtual vertices 
of 2 i} except for vertex v. The correctness of the merge operation can 
then be shown by proving first that if 7 T{i 2 ,... y}, z < p, represents a level planar 
embedding of 2 i}> then the vertices of Si form a consecutive sequence in 

and the vertex v is incident to Si. Notice that this result is not true, 
if the order of the merge process is changed. This proves that the operations for 
merging PQ-trees are correct. 

For completing the proof of correctness, let v G Vj+i, j < k, and let 
be a level planar reduced extended form with Sf yf 0 and |S'{"| < 1 for all 
w = 1,2, . . . ,v — 1 such that R^ has been constructed by zc-merging several 
reduced extended forms. Let R be the corresponding PQ-tree, representing all 
level planar embeddings of R^ . Let F be the component constructed from Rj 
by identifying all virtual vertices labeled z; to a single vertex v. It can be shown 
that the PQ-tree T{F) constructed from R by reducing 5" in Tj, replacing the 
sequence of leaves corresponding to S'" by a single representative, represents all 
level planar embeddings of P. □ 

Before determining the time complexity of the algorithm LEVEL-PLANAR- 
TEST, we determine the number of calls for REDUCE. Obviously, the number 
of calls for REDUCE in the first reduction phase is bounded by \V\, while the 
number of calls of REDUCE performed upon an successful INSERT operation 
is bounded by s — 1, where s denotes the number of sources of G. We show that 
at most s — 1 extra REDUCE operations are necessary if the reduced extended 
forms are merged according to their size. This is not a trivial result, as has been 
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stated by Heath and Pemmaraju (1996b). They observe that only one extra 
reduction is possible after every INSERT operation. This is only true for the 
first INSERT operation at a vertex v. If more components have to be merged, 
their observation is not true in general. 

Theorem 2. The algorithm LEVEL-PLANAR-TEST can he implemented to 
run in 0(|E|) time for any proper leveled graph G = (V, E). 

Proof. The linear time follows from an amortized analysis. We use the observa- 
tion of Di Battista and Nardelli (I1988II that in a level planar graph \E\ < 2\V\ —4 
holds. Heath and Pemmaraju (1996a, b) have shown that the overall number of 
operations that have to be performed on all calls of INSERT is bounded by 
0(|P|). The number of all operations performed on all calls of REDUCE during 
the first reduction phase is bounded by 0(|U|) which follows from a similar ar- 
gument as used by Booth and Lueker (II 97611 for the simple planarity test. The 
number of operations performed on all calls of REDUCE after a call of INSERT 
is proportional to the amount of work that has to be done in INSERT. Since the 
number of extra calls of REDUCE is bounded by one for each call of INSERT 
and the amount of work that has to be done for these extra calls is also propor- 
tional to the number of operations in INSERT, we conclude that the number of 
operations performed on all calls of REDUCE is bounded by 0(|U|). Further- 
more, we know that the total number of operations in order to update the leaves 
before merging PQ-trees is bounded by the number of operations performed in 
all calls of REDUCE. The total number of update operations that have to be 
performed after a merge phase of a level is complete is obviously bounded by 
0(|if|). Hence the total number of operations is bounded by 0(|U|). □ 

4 Remarks 

For simplicity, we restricted ourselves in this paper to the level planarity testing 
of proper leveled graphs. Of course, every non proper leveled graph can be trans- 
formed into a proper one by inserting dummy vertices. This strategy should not 
be applied since the resulting number of vertices may be quadratic in the original 
number of vertices. The following theorem shows that our level planarity test 
works on non proper leveled graphs as well as on proper leveled graphs, having 
a linear running time for both classes of leveled graphs. 

Theorem 3. The algorithm LEVEL-PLANAR-TEST tests any leveled graph 
G = (V, E) for level planarity in 0(|U|) time. 

Proof. Consider an edge e = (v,w), v G Vj, wSV), l<j</ — I<fc — I, 
traversing one or more levels. Inserting dummy vertices for e in order to construct 
a proper hierarchy would result in a graph G' such that every dummy vertex rt®, 
i G { j -I- 1, j -I- 2, . . . ,1 — 1} has exactly one incoming edge and one outgoing edge. 
However, the reduction of a PQ-tree T with respect to a set Lf with \U\ = 1 
and replacing the set by a new set Lf with \U'\ = 1 is trivial and does not 
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modify the PQ-tree. Hence we do not need to consider the dummy vertices and 
we therefore do not introduce them at all, yielding a linear level planarity test 
for general leveled graphs. □ 

An embedding of a general level planar graph G = (V, E) can be computed 
in linear time as follows: 

1. Add an extra vertex t on an extra level fc + 1 and compute a hierarchy by 
adding an outgoing edge to every sink without destroying level planarity. 

2. Add an extra vertex s on an extra level 0 and compute an st-graph by adding 
the edge (s, t) and an incoming edge to every source without destroying the 
level planarity. 

3. Compute a planar embedding using the algorithm by Chiba et al. ( II . 

4. Construct a level planar embedding from the planar embedding. 

The difficult part is to insert edges without destroying level planarity. We apply 
the following strategy. The idea is to determine the position of a sink t G Vj, 
j G {1, 2, . . . , fc — 1} by inserting an indicator as a leaf into the PQ-trees. This 
indicator is ignored throughout the application of the level planarity test and will 
be removed either with the leaves corresponding to the incoming edges of some 
vertex v G Vi, I G {j + 1, j + 2, . . . , fc}, or it can be found in the final PQ-tree. 
However, this strategy is accompanied by a set of difficult case distinctions that 
are to be discussed in another paper. Nevertheless, the time needed to compute 
a level planar embedding is bounded by 0(|C|) since the number of extra edges 
is bounded by the number of sinks and sources in G. 
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Abstract. We revoke the problem of drawing graphs in the plane so that 
only certain specified pairs of edges are allowed to cross. We overview 
some previous results and open problems, namely the connection to in- 
tersection graphs of curves in the plane. We complement these by stating 
a new conjecture and showing that its proof would solve the problem of 
algorithmic decidability of recognition of string graphs as well as realiz- 
ability of abstract topological graphs and feasible drawability of graphs 
with restricted edge crossings. 



1 Drawing Graphs When Only Some Pairs of Egdes Are 
Allowed to Cross 

Minimizing the number of crossing points in planar drawings of graphs in the 
plane is an important task. Too many crossing points - too dense a drawing - 
makes the chart badly readable for human eyes. It is well known that finding the 
exact minimum number of crossing points needed for planar drawing of a given 
graph, the so called crossing number, is an NP-hard problem pQ. On the other 
hand, this problem was deeply studied and well understood for many special 
graph classes and many exact results or at least tight bounds are known jS|. 
Much less studied is the question of drawing graphs for which only some pairs 
of egdes are allowed to cross. This natural generalization of planar graphs on 
one hand and crossing numbers on the other one was introduced in 0. Practi- 
cal motivation for this question are e.g. VLSI layouts with further restrictions 
- edges of certain types may not be allowed to cross because of their actual 
realization by connectors made from certain materials with different properties. 
Graph theoretical and complexity consequences of the concept of ‘allowing only 
certain pairs of edges to cross’ may bee even more interesting. Among them is 
the astonishing and challenging fact that so far there is no recursive algorithm 
known to decide the existence of a feasible drawing in this sense. 

In this note we want to reattract attention to the question of allowed cross- 
ings. We briefly review the definitions and give an overview of known results 
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and open problems. In the last section we report on a moderate progress in the 
question of recognition. Namely we introduce a new conjecture which, if settled 
in affirmative, would imply an exponential upper bound on the crossing number, 
and hence the existence of an (exponential but) finite recognition algorithm. 

A topological graph is a graph drawn in the plane with any number of edge 
crossings. However, some basic constraints are assumed, such as drawings of 
edges do not pass through vertices, any two edges share only a finite number of 
common points and they cross (i.e., do not touch) in the neighborhood of every 
common inner point (thus called a crossing point). No three edges pass through 
the same crossing point. A topological graph is thus a pair (G, D) where G is an 
abstract graph (i.e., G = (V, E) where ^ is a finite set of vertices and E C (^) 
is a set of edges, we consider undirected graphs without loops or multiple edges) 
and D is the drawing of G {D is a, mapping from VUE into points and Jordan 
arcs in the plane such that D{v) is the point representing vertex v in the drawing 
D and D{e) is the arc representing edge e, note that the arcs representing edges 
are considered open, i.e., their endpoints do not belong to the arcs). The drawing 
determines the set of crossing pairs of edges Rd = {{e, /} : D(e) n D{f) ^ 0}. 

The first natural question is to reverse the reasoning. A pair (G, R) is called 
an abstract topological graph (briefly an AT- graph) if G is a graph and R C (^) 
is a set of pairs of its edges. (We use this notion as only crossing pairs of edges 
are specified, but not the actual drawing.) Then it is natural to ask if G al- 
lows a drawing D such that Ro = R. Such a drawing (if it exists) is called a 
realization of {G,R). It is proved in 0 that it is NP-hard to decide if a given 
AT-graph is realizable in this sense. Though this approach has applications (e.g. 
for string graphs, see the next section), from the Graph Drawing point of view it 
seems more natural to ask whether (G, R) has a drawing D such that Ro C i?, 
i.e., whether G can be drawn in the plane so that only pairs of edges listed in 
R are allowed to cross. In the AT-graph is called weak realizable if such a 
drawing exists. Let us call such a drawing feasible in this note. Obviously not 
every AT-graph has a feasible drawing, e.g., ii R — % then (G, 0) has a feasible 
drawing if and only if G is planar. However, unlike planar graphs, it is NP-hard 
to decide if a given AT-graph allows a feasible drawing |^. But the situation is 
even more serious than NP-completeness. So far no finite algorithm is known to 
decide realizability and weak realizability of AT-graphs, and it is even known 
that straightforward strategy to place the decision problem in the class NP fails 
(see the next section) . In some sense this makes the problem theoretically more 
interesting, since there are not many graph theoretical problems with practical 
motivation which float above the class NP. (Perhaps it is the topological nature 
of the problem that causes this effect. Another problem for which no finite algo- 
rithm was known for a while, are linklessly embedabble graphs, now polynomial 
by Robertson-Seymour Graph Minor Machinery.) 
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G 3i^2 

R= {{a,5},{6,c}} 

Fig. 1. String graphs as representations of AT-graphs. 




2 Abstract Topological Graphs and Intersection Graphs 

Intersection representations of graphs (namely by planar objects) have many 
applications and there is now an extensive literature devoted to interval graphs, 
circle graphs, permutation graphs, cocomparability graphs and other intersection 
defined classes of graphs (cf. e.g. 0). String graphs (intersection graphs of curves 
in the plane) form the most general of them (they are exactly the intersection 
graphs of arc-connected sets in the plane) . String graphs are fully expressible in 
the language of AT-graphs: Given a graph G = (V, E) on n vertices, consider the 
matching nK 2 and give its edges the names of the vertices of G. Then G a is an 
intersection graph of curves if and only if the AT-graph (nAT 2 , E) is realizable. In 
fact, it is shown in |3j that recognizing string graphs is NP-hard, via a reduction 
from realizability of AT-graphs. Similarly as in the case of realizability of AT- 
graphs, no finite algorithm for recognition of string graphs is known. 

3 Crossing Number of AT-Graphs 

Definition 31 The crossing number (denoted by crat(G, R)) of an AT-graph 
(G, R) is the minimum possible number of crossing points in a feasible drawing 
of {G, R). We denote by crat (jn) the maximum of CTat (G, R) taken over the class 
of all feasibly drawable AT-graphs (G, R) with G having at most m edges. 

As a special case, crat{G, R) = cr(G) if i? = ('^ 2 '^^) allows any pair of edges 
to cross (here cr{G) denotes the traditional unrestricted crossing number of a 
graph G, where the crossing number is the minimum total number of crossing 
points in a planar drawing). Note that crati’tn) is finite for every to, since the 
maximum is taken over a finite set of AT-graphs. No upper bound, is however, 
known: 

Problem 1 Is there a recursive upper bound for cvat (m) ? 
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Fig. 2. Reducing the number of crossing poiirts iir air AT-graph realization. 



Note that proving any recursive upper bound would imply the existence of 
a finite recognition algorithm. One would just try all possible placements of 
crossing points on the edges of G and test planarity. 

The same argument applies to AT-graph realizability as well: Given a realiz- 
able AT-graph (G, R) with m edges, fix a realization and for any pair {e, /} G R, 
choose one crossing point P^f of the drawings of the edges e and /. Regard the 
sections of the edges of G between the marked crossing points (and between 
marked crossing points and vertices of G) as edges of a new graph G' (with 
vertex set V{G) U {Pef\{e, /} G i?}). Call this drawing D'. Now G' has at most 
m? edges (every edge of G has at most m — 1 marked crossing points with other 
edges), and hence crat{G',R' = Rd') < cratirn?)- Every feasible drawing of 
(G', R') then determines a drawing of G in which edges e, / share the point Pef, 
whenever {e, /} G R. Some of these points may have become touching points, so 
after overlaping the curves near such points (creating two crossing points from 
one touch point) we get a full realization of (G, R) with at most 2crat{rn?) cross- 
ing points (cf. Figure El for illustration). Now the same argument as for feasible 
drawability applies - guess orderiirgs of the crossing points along the edges and 



242 



Jan Kratochvfl 



test planarity. Noting that string graphs are special types of AT-graphs, we have 
the following: 



Observation 32 An affirmative answer to Problem^ would guarantee the exis- 
tence of finite algorithms for recognizing string graphs, realizability of AT-graphs 
and weak realizability of AT-graphs. 



One may ask if AT-graphs can force large crossing numbers. Yes, they can. 
Here comes perhaps the first real difference between the crossing numbers of 
graphs and AT-graphs. While in every optimal (unrestricted) drawing of a 
graph every two edges share at most one crossing point (folklore), consider 
the example in Figure 0 The only pairs of edges that are allowed to cross are 
{a, 5}, {o, c}, {a, d}, {a, e}, {6, /}, {6, g}, {b, h}. The frame G—{a, b} has a unique 
planar embedding and thus the drawing in the figure is the only feasible draw- 
ing of the AT-graph (from topological point of view). The edges a,b share at 
least 2 crossing points, and besides that, they also share a common vertex. This 
is another difference from ordinary crossing number - in every optimal (unre- 
stricted) drawing of a graph no two edges incident with the same vertex cross 
(again folklore). 




Fig. 3. AT-graph requiring double crossing. 



It is somewhat surprising that there exist AT-graphs which require exponen- 
tial number of crossing points: 



Proposition 33 jS] For every large enough m, crat{m) > 2e . 

One can read this result as follows: The first natural strategy to place the 
recognition problem of weakly realizable AT-graphs in the class NP, namely 
guessing the crossing points of the edges and checking planarity, necessarily 
fails, as there graphs which would require exponentially large underlying graph 
of the realization, and planarity checking would not be polynomial in the size of 
the actual input. (This of course does not prove that the recognition is not in 
NP.) It is conjectured in |5] that cTatfm) < for some contstant c. 
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4 Is There Always a Single Crossing? 

In this section we introduce a conjecture which, if true, would imply a single 
exponential upper bound for crat(m), thereby answering Problem Hand proving 
the conjecture of p|. 

Conjecture 2 In any optimal feasible drawing of an AT-graph, any edge which 
is crossed by at least one other edge is crossed by some edge exactly once. 

In a feasible drawing D of G, let ci)(e) denote the number of edges that cross 
D{e). 

Lemma 41 Assume Conjecture]^ is true. Let D be an optimal feasible drawing 
of an AT-graph (G,R). Then every edge e shares at most — 1 crossing 

points with other edges. 





Fig. 4. 



Proof: The proof goes by induction on the number of crossing points in D, 
i.e., on crat{G, R). The statement is obviously true if e is not crossed by other 
edges, i.e., C£>(e) = 0. Suppose C£)(e) > 1. By Conjecture 0, there is an edge, say 
f = rs, which shares exactly one crossing point with e. Let P be the crossing 
point of D(e) and D{f). Create a new graph G' with vertex set 

V' = V{G) U {P} 



and edge set 



E' = {E{G) - {e, /}) U {uP, vP, rP, sP}. 
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We let G' resume the drawing D of G, i.e., let D' be the drawing physically 
identical to D, such that D'{x) = D{x) for every vertex x G V{G), D'{h) — D{h) 
for every edge h G E{G) — {e, /}, D'{P) = P and the arc D{u)P on D{e) is 
regarded as D'{uP), the arc D{v)P on D{e) is regarded as D'{vP) and similarly 
for f = rs. We define R' = Rd' as the set of pairs of edges actually crossing 
in D' . The drawing D' has one crossing less than D, i.e., crntiG.R) — 1. Hence 

Suppose {G',R') has a better feasible drawing, i.e., with less crossing points. 
Such a drawing would yield a feasible drawing of (G, R) (setting D{e) = D'{uP)U 
{P} U D'{vP) and D{f) = D'{rP) U {P} U D'{sP), no restrictions of R are 
violated and D has at most one crossing point - P - more than D'). Since D was 
an optimal feasible drawing of (G,R), this is not possible. Hence crat{G' ,R') = 
crat(G,R) - 1. 

It follows that D' is an optimal feasible drawing of (G', R') with less crossing 
points than D, and we may apply induction hypothesis. Note that cd'{uP) < 
CD'(e) — 1 and cd'{vP) < CD'(e) — 1 (since neither / nor its descendants rP, sP 
cross e and in D', e is allowed to cross only those edges that e crossed in D). By 
induction hypothesis, D'(uP) shares at most 

2‘^d'GP) _ 1 < 2'=ri(e)-l _ 2 ^ 

crossing points with other edges, and D'{vP) shares at most 

2Cjy,{vP) _ ^ ^ _ 1 

crossing points with other edges. Going back to P, we see that D{e) shares at 
most 

2‘^d'Gp) — 1 -(- — 1 + 1 < 2 • (2°°^®)“^ — 1) + 1 = 2®^*-®^ — 1 

crossing points with other edges. □ 

Corollary 42 Suppose Conjecture m is true. Then cratijn) < ^(2"^ ^ — !)• 

Proof: Let D be an optimal feasible drawing of an AT-graph (G,R). Each of 
the m edges is crossed by at most m — 1 other edges, and hence each edge shares 
at most 2"^“^ — 1 crossing points with other edges (by Lemma This is in 
total — 1), each being counted twice. □ 



5 Conclusion 

We believe that the question of minimizing the number of crossing points in fea- 
sible drawings of AT-graphs is interesting and worth further consideration. The 
main interest in author’s eyes lies in Conjecture El which would settle the annoy- 
ing fact that so far no finite algorithm for recognition of string graphs (nor of 
realizable AT-graphs) is known. Apart of this conjecture, determining bounds for 
crossing number of particular families of AT-graphs may be of separate interest. 
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Abstract. The paper presents self-organizing graphs, a novel approach 
to graph layout based on a competitive learning algorithm. This method 
is an extension of self-organization strategies known from unsupervised 
neural networks, namely from Kohonen’s self-organizing map. Its main 
advantage is that it is very flexibly adaptable to arbitrary types of visual- 
ization spaces, for it is explicitly parameterized by a metric model of the 
layout space. Yet the method consumes comparatively little computa- 
tional resources and does not need any heavy-duty preprocessing. Unlike 
with other stochastic layout algorithms, not even the costly repeated 
evaluation of an objective function is required. To our knowledge this is 
the first connectionist approach to graph layout. The paper presents ap- 
plications to 2D-layout as well as to 3D-layout and to layout in arbitrary 
metric spaces, such as networks on spherical surfaces. 



1 Introduction 

Despite the fact that large efforts have been devoted to the construction of graph 
layout tools, their usability in practical applications is still relatively limited for 
large graphs and in the case of non-standard layout requirements. Two impor- 
tant issues that have to be addressed are flexibility and speed. The dilemma is 
that fast algorithms, such as Sugiyama layout ^], are usually highly specialized 
and tailored for a particular domain. On the other hand, more flexible declar- 
ative layout methods, in particular simulated annealing and other general 
stochastic optimization methods, are computationally very expensive p]. The 
situation becomes worse if the evaluation of the cost function is expensive, such 
as checking the (potentially quadratic) number of edge crossings, because it has 
to be evaluated on every iteration. Especially genetic algorithms can exhibit a 
very problematic performance m In real-world tasks, such as re-engineering, 
graph sizes can easily reach more than 100000 nodes. In such cases speed is of 
prime importance even if a sub-optimal layout has to be accepted. 

This paper introduces a flexible new layout method called ISOM layout that 
in comparison with other stochastic techniques consumes only little computa- 
tional resources. No heavy-duty preprocessing and no costly repeated evaluation 
of an objective function are required. One of the method’s major advantages is 
its extreme versatility in regard to the visualization space used. The algorithm 
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is explicitly parameterized with a metric of the layout space and there is no 
limitation on the metric that can be used. It is therefore directly useable for 2D 
and 3D-graph layout as well as for non-standard layouts, for example in non- 
rectangular viewing areas. Even specialized layout tasks like embedding a net- 
work into a spherical surface can directly be solved as we will demonstrate. The 
method presented is based on a competitive learning algorithm which is derived 
from well-known self-organization strategies of unsupervised neural networks, 
namely from Kohonen’s self-organizing maps dlTlIlOl- To our knowledge this 
is the first connectionist approach to graph layout. 

At a first glance, a number of arguments are apparently speaking against 
the application of NN to problems such as graph layout: It is difficult to han- 
dle symbolic relations with NN and structures of potentially unlimited size are 
not easily accommodated in a NN. Some of these problems have recently been 
addressed by the neural folding architecture HH and by adaptive structure pro- 
cessing jO). However, our approach uses an entirely different way to overcome 
these limitations: We will not use an external network structure “to learn the 
graph”, instead the graph itself will be turned into a learning network. 

The central problem in graph layout is that it requires to solve computation- 
ally hard global optimization problems. As several excellent solutions for other 
computationally hard optimization tasks prove, optimization is one of the partic- 
ular strengths of NN. Prominent examples are the travelling salesman problem 
or graph-theoretic problems like optimal bipartitioning m- It therefore seems 
promising to study NN for graph layout. 

The main advantage of using a NN method for optimization problems is that 
we do not have to construct a suitable heuristics by hand. Instead, the network 
discovers the search heuristics automatically or — putting it less mysteriously — a 
meta-heuristics is built into the learning algorithm of the network. As a conse- 
quence, the implementation of the method is very simple. 

2 Kohonen’s Self-Organizing Maps 

The model of self-organizing graphs which we are going to present is an exten- 
sion of a well-established neural network type, namely Kohonen’s self-organizing 
maps (SOM), which are a kind of unsupervised competitive network. We will 
therefore have to briefly review the basic idea of competitive learning before 
we can introduce self-organizing graphs. A more general introduction to neural 
networks is beyond the scope of this paper. The interested reader is referred to 
m and P as well as Kohonen’s books dEO]- 

2.1 Competitive Learning 

There are two different types of learning for NN: Supervised and unsupervised 
learning. Supervised learning requires an a-priori defined learning objective and 
a “teacher” external to the network. In the learning phase this teacher (or super- 
vision procedure) judges how close the network’s response is to the intended solu- 
tion and makes appropriate adjustments to the network’s connection strengths 
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(weights) in order to tune it towards the correct response. Once the learning 
phase is finished, this enables the network to perform according to the prede- 
fined objective function. 

In unsupervised learning there is no teacher and no a-priori objective func- 
tion: The net has to discover the optimization criteria itself. This, of course, 
means that a particular network type can only perform well for a particular kind 
of task. Typical application areas of unsupervised learning are: clustering, auto- 
association, content-based retrieval, encoding, compression, and feature map- 
ping. The best-known NN models of unsupervised learning are Hebbian learning 
j'l Hlj and the models of competitive learning: The adaptive resonance theory HSI, 
and the self-organizing map or Kohonen network which will be explained in the 
following. Some discussion of the usage of unsupervised learning for visualization 
tasks can be found in | 23 |. In the case of graph layout, using an unsupervised 
learning method means that we will not teach the layout aesthetics to the net- 
work. Instead we will let the net discover the appropriate criteria itself. 

The basic idea of competitive learning is that a number of output units 
compete for being the “winner” for a given input signal. This winner is the 
unit to be adapted such that it responds even better to this signal. In hard 
competitive a.k.a. “winner-take-all” learning only a single unit is adapted. In 
contrast, soft learning adapts several units at once. In a NN typically the unit 
with the highest response is selected as the winner. 

The learning process therefore requires to elect a winner and to apply a 
selective weight adjustment in every learning cycle. This clearly sounds as if 
some kind of supervision procedure was needed and as if we were discussing 
a supervised learning scheme. Later we will indeed simplify (and accelerate) 
the learning method by using an external supervision procedure, but the same 
result can also be achieved with unsupervised learning. The key to managing 
the selection of the winner as well as a selective update without supervision is 
to use lateral and recursive network connections. 

Most unsupervised competitive networks have a rather simple structure and 
since we are aiming at Kohonen networks, we will only discuss networks that 
consist of a single input layer and a single output layer (the so-called com- 
petitive layer) here. The competitive net is wired such that each unit in the 
competitive layer is connected to every input unit. Additionally each compet- 
itive unit is connected to itself via an excitatory (positive) connection and it 
is connected to all other units in the competitive layer via inhibitory (neg- 
ative) connections (see Figure QI). As usual, the response of a unit is calcu- 
lated as a (usually sigmoidal) output function a of the net input to this unit, 
i.e. as a function of the sum of all signals received from the inputs xi, . . . ,Xfc 
and via the lateral connections weighted by the respective connection strengths: 

update is made repeatedly the net- 
work will exhibit a behaviour called competitive dynamics: Because of the lateral 
connections the node which initially has the greatest response to the input in- 
creases its own activation via self-excitation at the same time strongly inhibiting 
the other units via the inhibitory lateral connections. With a suitable choice of 
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*1 X2 X3 X4 



Fig. 1. Competitive Layer Schematics 





Fig. 2. Updates in Weight-Space 



the activation function a and the lateral weights, the network will for any given 
input eventually settle into an equilibrium where only the single winner unit Uj 
is active and all other nodes have their activation reduced to zero ^21 • 

It remains to be shown how a selective weight update can be performed 
without supervision. But since in the equilibrium state of an ideal competitive 
net only the single winner unit Uj is active (i.e. has a response Vj > 0), the 
update can just be enforced for every unit across the entire network provided it 
is weighted by the corresponding unit’s response. In this way only the weights 
belonging to the winner unit will be updated. The entire competitive learning 
procedure can thus be formulated in the following way: (1) present an input 
vector i to the net and let the net settle into equilibrium, (2) for every node 
Uj enforce a weight correction ^Wj = arj (i — Wj) where a is suitable learning 
factor. 

So far we have regarded the units’ responses as the desired output, but for 
our purposes it is much more interesting to switch to a different perspective and 
to look at the weights of the competitive units instead. In fact, from now on 
we will ignore the actual responses of the units altogether. Eventually we will 
transform our layout task into a problem in weight-space and solve it entirely 
there. Each weight set for a competitive unit Uj can be represented by a vec- 
tor Wj = (jWj^i, . . . ,Wj^n)- If we assume that input vectors and weight vectors 
are normalized according to || Wj ||= = 1 then these vectors can be 

represented by arrows to the surface of an n-dimensional unit hypersphere. In 
weight-space a learning step can now be interpreted as turning the weight vector 
of the winning unit towards the current input vector. Starting from an initial 
random distribution of the weight vectors, the network will therefore attempt 
to align its weight vectors with the input vectors it is seeing. In this way it ob- 
viously solves an instance of a clustering task: The weight vectors are clustered 
with the input vectors. Figure 0 shows an example: Starting from the initial ran- 
dom configuration on the left side the network moves towards the configuration 
on the right side of the same figure. 

2.2 The Kohonen Network 

If we switch from hard competitive learning to soft learning, several units may 
be adapted at once. The question is, which units shall be chosen for an update? 
If the network has some known spatial arrangement, one of the possibilities is 
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to update the winner together with its neighboring nodes. This is the learning 
algorithm used by Kohonen’s self-organizing maps. In real biological neural net- 
works the spatial arrangement is in fact important and not only the strength 
but also the location of a neural excitation conveys information. In order to 
understand the importance of the spatial organization we need to have some 
suitable metric for the sensory signals such that we can judge the proximity 
of two input signals. If we know the network’s geometric arrangement (i.e. the 
spatial locations of the individual units) such an input metric enables us to an- 
alyze the relationship between proximity of input signals and spatial proximity 
of the resulting network excitation. Surprisingly, in the mammal brain it is of- 
ten the case that spatially close regions of cells respond to input stimuli that 
are in proximity. Such mappings of metric regions of the input space to spatial 
regions of the brain (or, more abstractly, metric regions of the output space) 
are called topology preserving feature maps or topographic maps. Striking exam- 
ples of such topographic maps in the mammal brain are the retinotopic map, 
where close regions of the retina are mapped to close regions of the cortex, and 
the somatosensory map, where close regions of the body surface are mapped 
to close regions of the somatosensory cortex. These are both spatial metrics, 
but examples of more abstract metrics can also be found. The tonotopic map 
from the ear to the auditory cortex, for example, works such that spatially close 
cells correspond to hearing similar frequencies. In fact, the seminal study in the 
field ^21 established this kind of abstract mapping for the orientation receptor 
cells which react to specific orientations of visual stimuli (such as grids). Their 
spatial arrangement is such that cells located in proximity correspond to similar 
angles of stimuli. Since topographic maps are a common phenomenon, some self- 
organization mechanism that automatically performs the corresponding neural 
“wiring” is likely to exist. Von der Mahlsburg m succeeded first in showing that 
competitive learning can achieve this. Kohonen later extended and simplified the 
model m dODI, casting it into the computationally more adequate form of 
the so-called self-organizing map (SOM). 

Kohonen’s networks use two relatively simple spatial configurations: They 
are either rectangular or hexagonal grids implying an 8-neighborhood or a 6- 
neighborhood, respectively. The network structure again is a single layer of out- 
put units without lateral connections and a layer of n input units in which each 
output unit is connected to each input unit. 

Since Kohonen networks are a computational method we can sacrifice the 
biological justification and simplify the situation by using an external supervision 
process to select the winner and to update its weights. In this case we do not 
need any lateral connections and we do not need to wait for the network to 
settle into a stable state so that the network response can be computed much 
faster, namely in constant time. In this way Kohonen’s learning procedure can 
be formulated as: 

1. present a stimulus vector v to the network, 

2. find the unit Uj with the largest response Vj, 
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3. adapt the weights of Uj and all nodes in a neighborhood of a certain radius 
r, according to the function Awi = r]{t) A{uj,Ui) {v — Wi). 

4. After every A:-th stimulus decrease the radius r. 

rj{t) is a time-dependent adaption factor and A(uj^ ut) is a neighborhood function 
the value of which decreases with increasing distance between Ui and Uj. Thus 
the winner is adapted strongly whereas the influence of the input diminishes 
with increasing distance from the winning unit. This process is iterated until the 
learning rate rj{t) falls below a certain threshold. 

For the selection of the winner unit it is, in fact, not at all necessary to 
compute the units’ responses. As Kohonen shows, the winner unit Uj can as well 
be taken to be the one with the smallest distance || v — wj || to the stimulus 
vector, i.e. j = argmirii^^i^ || v — Wi |j. Both criteria turn out to be identical 
for normalized vectors. 

We can think of the adaption as being determined by a “cooling” param- 
eter rj, which decreases the adaption with increasing training time, a “decay” 
parameter A, which decreases the adaption with increasing distance from the 
winning unit, and a “narrowing” parameter k which decreases the spatial extent 
of adaption over time. Kohonen demonstrates impressively that for a suitable 
choice of the learning parameters the output network organizes itself as a topo- 
graphic map of the input. Various forms are possible for the parameter functions, 
but negative exponential functions produce the best results, the intuition being 
that a coarse organization of the network is quickly achieved in early phases, 
whereas a localized fine organization is performed more slowly in later phases. 
Therefore common choices are: A{ui,Uj) = ^nd r]{t) = 

where d(ui, Uj) is the topological distance of Ui and Uj and cr is a time-dependent 
width factor of the form a{t) = aat~°‘ . 

3 Prom Self-Organizing Maps to Self-Organizing Graphs 

We have mentioned above that the first key to solving the layout task is to 
look at the network’s behaviour in weight-space instead of at its responses. If 
we visualize the behaviour of the SOM in weight-space, the connection to graph 
layout will immediately become clear. Restricting the input to two dimensions, 
each weight vector can naturally be interpreted as a position in 2D-space. 

Figure 01 illustrates 
the learning process of a 
SOM with 9 units in a 
rectangular grid. The 4- 
neighborhood is depicted 
by straight lines. The ini- 
tial random weight distri- 
bution (left) eventually 
settles into an organized 
topographic map (right). 

Each unit has moved to one of the stimulus positions marked by black dots. 





Fig. 3. Learning of a Topographic Map 
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If we interpret the network’s weight-space in Figure 0 as the embedding of 
a grid-like graph, the connection of SOM learning to graph layout immediately 
becomes obvious: In its self-organization process the SOM has obviously created 
a reasonable, if not perfect, layout for its network graph. This is not incidental, 
since the optimization criteria for learning a topographic mapping and for per- 
forming a graph layout are quite similar under a suitable transformation of the 
layout problem. 

As mentioned, the first key is to look at the weight-space instead of at the 
output response and to interpret the weight-space as a spatial embedding of the 
graph. Abstractly speaking, the SOM constructs a metric-preserving mapping 
from the m-dimensional weight-space onto the n-dimensional input space. Let us 
inspect this more closely and have a look at various ways to intuitively interpret 
the way the SOM works: To cite m “••• we can think of a sort of elastic net in 
input space that wants to come as close as possible to the inputs; the net has the 
topology of the output array (i.e. a line or a plane) and the points of the net have 
the weights as coordinates.” In other words, the learning process “stretches” the 
network such that its nodes cluster with the input positions while at the same 
time matching the topology of the network with the metric of the input space. 

Clearly, this is quite similar to the task that we have to solve for graph 
layout: There we have to find an embedding of the nodes such that the Euclidean 
distance of a pair of nodes matches their graph-theoretic distance. The main 
difference is that in the case of the SOM we have only dealt with very simple, 
fixed 2D-network topologies, namely rectangular or hexagonal grids, whereas in 
graph layout we must handle arbitrary topologies which are different for every 
new task. The second key therefore is to realize that there are no restrictions on 
the topology that we can give to the SOM’s network. The learning process will 
always attempt to construct a metric preserving mapping between the input 
space and the network topology. In fact, recent models of competitive neural 
networks for other problem domains, such as the growing neural gas jE], are 
also using different topologies or even topologies that evolve during the training 
process. The idea therefore is to train a competitive network that has the same 
topology as the graph to be laid out. Now each unit of the network can be identified 
with a node of the graph and each unit’s weight vector can be interpreted as 
the spatial embedding of this node. Analogously to the SOM, we expect such 
a network to settle into a configuration where the nodes are clustered with the 
input positions and where their Euclidean distances match their graph-theoretic 
distances. 

Note that a hidden shift of perspective was the third key to the solution: 
Instead of training the network to compute a certain input-output relation we 
are regarding the training phase as the intended computation. The network is 
never actually used once it is trained. 

One question remains to be answered: Since the network clusters its weight 
vectors with the input stimuli it is seeing, the set of input vectors clearly de- 
termines the final layout. As can been seen in Figure 0 each node of the graph 
will move towards some stimulus position. So how can a reasonable set of input 
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stimuli be obtained? The solution to this problem is surprisingly simple: A set 
of points that is distributed uniformly in the input area is used as the set of input 
stimuli. Using a uniform distribution has the important property that it causes 
the net to stretch such that it uniformly fills the available input space (up to 
a small border zone that remains unused). Of course, it also makes the set of 
input stimuli independent from the actual graph to be laid out. 



3.1 The ISOM Layout Algorithm 

We are now set to detail the layout algorithm outlined above. The main differ- 
ences to the original SOM are not so much to be sought in the actual process 
of computation as in the interpretation of input and output. First, the problem 
input given to our method is the network topology and not the set of stim- 
uli. The stimuli themselves are no longer part of the problem description but 
a fixed part of the algorithm. Secondly, we are interpreting the weight-space as 
the output parameter. The actual network output is discarded completely. As 
a consequence, there is no activation function cr. Because of this inverted per- 
spective the method is termed the inverted self-organizing map (ISOM). Apart 
from these changes, we are using slightly different cooling and decay parameters 
which have proven useful in experiments. 

Algorithm ISOM 

input: a graph G = (V,E), output: a spatial embedding of G 
epoch t := 1; 

radius r := Umax] /* initial radius */ 
cooling factor c; 

forall V G V do v.pos := random-vectorf); 
while (t < tmax) do { 

adaption a:= max{min-adaption, • max -adaption) 

i := random-vectorf)] /* uniformly distributed in input area */ 
w := V G V such that || v.pos — i |j is minimal 
for w and all successors Wk of w with d{w, Wk) < r do 
Wk-pos := Wk-pos — a {wk.pos — i); 

t := t -|- 1; 

if t mod interval = 0 and r > min-radius do r := r — 1; 

} end. 

Note that the node positions Wk-pos which take the role of the weights in 
the SOM are given by vectors so that the corresponding operations are vector 
operations. Also note the presence of a few extra parameters such as the minimal 
and maximal adaption, the minimal and initial radius, the cooling factor, and the 
maximum number of iterations. Suitable values have to be found experimentally. 




254 Bernd Meyer 



3.2 Comparison to Force-Directed Layout 

We will now show that the objective layout criteria implicitly used by the ISOM 
are closely related to those that are used in the well-established group of force- 
directed methods 00 

Force-directed layout is minimizing the energy in the edges which are un- 
derstood as springs attached to the nodes. These springs are used to model 
attraction forces and repellent forces between neighboring nodes. Computing 
a force-directed layout means to find a configuration where these forces are in 
balance. The ISOM, on the other hand, is optimizing the embedding for the 
following two objectives: (1) the graph-theoretic distance of all node pairs is 
matched with their metric distance and (2) a uniform space filling distribution 
of nodes is generated. We can understand (1) as the analogue of attractional 
forces, since neighboring nodes move towards the same stimulus positions, and 
(2) as the analogue of repellent forces, since the nodes are trying to drift apart 
in order to fill the space. 

Intuitively the objectives of the ISOM and of basic force-directed layout are 
closely related and with both methods symmetries are automatically exhibited. 
One of the main differences is that the ISOM achieves the approximation of 
the implied computationally hard optimization problem as the byproduct of 
a stochastic self-organization process. This eliminates the need to find a good 
heuristic procedure for computing the equilibrium of forces and, in fact, the 
ISOM procedure is potentially faster. 

The computation performed in a single iteration is inexpensive. The winner 
can be found in linear time (or in logarithmic time, if suitable spatial index 
structures are used). If the graph is not dense, only a constant number of suc- 
cessor nodes will be updated due to the limited radius. Each of these nodes can 
be accessed in constant time from its predecessor and the correction factor can 
also be computed in constant time. Thus, for a graph of bounded degree if the 
number of epochs and the initial radius is considered as fixed, the entire layout 
computation can be done in linear time. Even if we include the required number 
of iterations as a dependent variable in the complexity calculation, the overall 
complexity appears to be at most quadratic, since the experimental results in- 
dicate that it is sufficient to use a number of iterations that is linear in the size 
of the graph. 

In contrast to force directed-models, the ISOM is easily adapted to any kind 
of layout space, since it can be explicitly parameterized with the metric of this 
space (see Section 0). On the other hand, some additional layout objectives, 
such as orthogonal drawings, are more easily introduced in force-directed models, 
since these can be modelled explicitly by extra forces m- 

4 Experimental Evaluation 

Let us now give experimental results and some small examples, space not per- 
mitting more. We have implemented a Java applet which is available for further 
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exploration at http://www.bounce.to/Bernd.Meyer. The experiments confirm 
our theoretical expectations and show that the ISOM converges unexpectedly 
fast towards reasonable layouts. For medium-sized graphs (of up to approxi- 
mately 25 nodes) typically not more than 500 epochs are required to produce 
a nice layout. The basic structural organization of the graph happens very fast 
in the early phases of the computation while later phases with small adaptions 
and radiuses mainly serve to refine the layout. 

The choice of parameters can be important, but the ISOM seems fairly robust 
against small parameter changes and usually quickly settles into one of a few 
stable configurations. As a rule of thumb for medium-sized graphs, 500 epochs 
with a cooling factor c = 0.4 yield good results. The initial radius obviously 
depends on the size and connectivity of the graph. Tmax=^ with an initial adap- 
tion of 0.8 was used for the examples. The interval for radius decrease has to be 
chosen such that most of the adaption happens while r = 1 (with adaptions of, 
say, 0.4 . . . 0.15). The final phase with r = 0 should only use very small adap- 
tion factors (approximately below 0.15) and can often be dropped altogether. 
A long phase for r = 0 with too high adaption factors is not advisable, since 
the symmetry of the layout may be destroyed. This is not surprising, since we 
know that a one-dimensional SOM will eventually be arranged as a space-filling 
peano curve, if phase r = 0 is active for too long. In fact, it is fairly obvious 
that the SOM algorithm for r = 0 reduces to mere vector quantization, since the 
topological structure of the network is no longer taken into regard. 

As a first small extension to 
the basic model we can use dif- 
ferent layout areas, such as non- 
rectangular regions. This is sim- 
ply done by choosing a distribu- 
tion of input stimuli that is uni- 
formly distributed in this region. 

The only restriction on the shape 
of this area is that it must be con- 
vex, because otherwise edges might 
shortcut across regions where the 
layout area is caving in and even 
nodes might move outside of the 
layout area. We observe that using 
different layout areas leads to dif- 
ferent layouts which are reasonably 
adapted to the available area. For 
example the layout of the complete 
graph Kq in Figure El was gener- 
ated with a triangular distribution, 

while Figure 0 was generated with Fig. 4. The ISOM Applet at Work 
a rectangular distribution. 
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Free trees can be handled as Figure El illustrates, but a drawback is that 
there is no straightforward possibility to draw a rooted tree according to the 
usual layout conventions with the basic algorithm. 

5 Self-Organizing Graphs in 3D and on Spheres 

An obvious extension that comes to mind is to use the same method for 3D- 
layout. The changes are straightforward: All that is required is to use 3D-vectors 
for input stimuli and weights. Apart form this, the algorithm remains unchanged. 
In most cases a reasonable 3D-structure is obtained (FigureES)- It is interesting 
to compare a 3D-layout with a layout of the same graph in 2D-space. 

For a cube, for example, the 2D-ISOM 
generates exactly a 2D-projection of the 
layout generated by the 3D-ISOM (Fig- 
ure 0 right) instead of the planar 2D- 
layout which would also be possible (Fig- 
ureEl left). This is because the non-planar Fig. 5. 3D-Structures in 2D-Space 
layout conforms better to the criterion of 
uniform edge lengths. 

Despite the fact that many layout methods give preference to planar layouts, 
this example may serve to illustrate that a planar layout is not always superior 
to a non-planar one. The important property of a good layout is to make the 
structure of the graph plainly recognizable. Depending on what the meaning of 
the graph is, this may sometimes even better be done with a non-planar layout. 

Though the 3D-structure of a graph may already become apparent in a 2D- 
layout, a real 3D-layout is often required for more complex structures. We have 
implemented a prototype simulator for self-organizing 3D-graphs in Mathemat- 
ic a 123 which allows to look at the 3D-graph from different viewpoints or to 
generate movies that show a 3D-layout rotating in 3D-space. Experience shows 
that such possibilities are often required to fully recognize the 3D-structure on a 
2D-display. While the above suggests that the ISOM works well in 3D-space, a 
critical assessment is in place. It is not really clear whether the optimization of 
layout objectives derived for 2D-space always leads to good 3D-drawings. This 
has also been observed by other authors El- fact, the aesthetic criteria gov- 
erning 3D-layout are not well-understood, since we are dealing with a different 
type of visual perception in 3D. In particular, we tend to interpret closed paths 
as surfaces. If the structure of the graph is already known, automatically gener- 
ated 3D-layouts can deviate quite far from the expectation. On the positive side 
they often reveal additional structural properties (in particular symmetries) that 
would not be recognized in the preconceived layout. This can be illustrated with 
the layout in Figure which could also be drawn as a cube inside of another 
cube with corresponding corners of the inner and outer cube connected. 
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Fig. 6. The Complete Graph Kq Fig. 7. A Rectangular Layout of Kq 





Fig. 8. A Layout of a Free Tree 



Fig. 9. Another Example 
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The ISOM is easily exten- 
sible to other special layout 
spaces. In particular, there are 
several interesting applications 
of layout on arbitrary spheri- 
cal surfaces: (1) It can serve to 
utilize the display space more 
efficiently. A generalized fish- 
eye view, for example, is but 
the projection of a uniform lay- 
out on some spherical surface. 

Real spherical 3D-layout opens 
even more interesting possibil- 
ities: A graph can, for exam- 
ple, be displayed on the surface 
of a globe that can be inter- 
actively rotated. This combines 
the space utilization of fisheye 
views with a novel interaction 
mode for graph exploration. (2) 

A layout on a spherical surface 
often provides a good alternative to non-planar straight line drawings without 
burdening the user with the entire complexity of understanding a 3D-layout on 
a 2D-display device (see Fig, ll 2ll . (3) Finally, a spherical layout may simply be 
part of the defined problem, such as a visualization of a world-wide network on 
a globe. 

The so far implicit parameterization of the algorithm with the metric of the 
layout space is easily made explicit. The distance between Ui and uj must now be 
defined as the length of the shortest curve on the surface that connects Ui and Uj 
and the adaption must be modified such that the node to be adapted moves along 
this curve by the appropriate amount. In most cases it is easy to define the curve 
connecting Ui and Uj in parametric form: r(t) = {x(t),y(t), z(t)} for 0 < f < I. 
Let Wi = r(ti) and Wj = r{t 2 ). The length of the curve connecting Ui and Uj is 
then simply given by 




Whenever this integral can be expressed in closed form (or numerically aprox- 
imated), the generalized ISOM below can be used for the particular surface. 
For a unit hypersphere the formulas become particularly simple. Since all 
shortest connections between two points Ui and Uj now lie on greater cir- 
cles and the length of the connecting arc on a unit hypersphere is equal to 
the cosine of the associated central angle, the distance formula is reduced to: 
s{wi,Wj) = cos = Wi ■ Wj The updated position thus is simply 

given by the normalized vector ||^ with r := Wk-pos — a {wk-pos — i). 
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Algorithm ISOM-generalized 

input: a graph G = (V,E), output: a spatial embedding of G 
epoch t := 1; 

radius r := Tmax] /* initial radius */ 
cooling factor c; 

forall V & V do v.pos := random-vectorQ; 
while (t < tmax) do { 

adaption a:= max{min-adaption, • max -adaption) 

i := random-vector{); /* uniformly distributed on layout surface */ 
w := V G V such that s{v.pos,i) is minimal 
for w and all successors Wk of w with d{w, Wk) < r do 
Let r(ti) = Wk-pos; 

Wk-pos := r{t 2 ) such that = 1 - a; 

t '.= t 

if t mod interval = 0 and r > min-radius do r := r — 1; 

} end. 

A prototype version of self-organizing spherical graphs has also been imple- 
mented in Mathematica. Figure El shows one viewpoint of the spherical layout 
of a hexagonal cylinder as an example. 

6 Extensions 

A problem that can occur in some layouts are “collapses” or “clashes” in which 
two (connected or unconnected) nodes are moving towards the same position. 
Theoretically three types of clashes could occur: edge-edge clashes, node-edge 
clashes, or node-node clashes. Node-node clashes can be avoided by (1) choosing 
a larger layout area and (2) choosing different cooling factors or more epochs such 
that the final phase with r = 0 is extended and the node distribution becomes 
more “space filling”. An alternative way to deal with node-clashes is a post- 
process which zooms in on the clashing cluster and generates a new local layout 
by applying ISOM layout only to the nodes in the zoomed area. Nodes outside 
of this area are ignored and remain unchanged during this post-process. Both 
solutions are not completely satisfying, primarily because they require interven- 
tion from the user. Also some structures are notorious for letting unconnected 
or unrelated nodes move towards the same position, and an alternative layout 
in which these nodes are located in entirely different places may be preferable 
depending on the context (think back to the example in Figure [HJ- Choosing 
different parameters does separate clashing nodes. It does not, however, find an 
entirely different structure. We are currently investigating the use of different 
types of decay functions A in order to achieve the desired effect. Particularly 
promising seems the usage of the so-called Mexican hat distribution (the solid 
curve in Figure El) instead of the standard Gaussian distribution (the dashed 
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curve). Because the Mexican hat function falls below zero above a certain dis- 
tance, it can be used to simulate a force that pushes unrelated nodes apart (i.e. 
nodes with a large topological distance). Of course, the usage of the clipping 
radius must be modified accordingly. 

As for edge-clashes, since every edge-edge clash implies a node-edge clash, it 
is sufficient to eliminate the latter type. A simple way to achieve this is to use 
a post-process that substitutes each edge running through a node to which it is 
not adjacent by a curved edge that avoids the node. 



There are a number of extensions that come to 
mind which we did not yet have a chance to experi- 
ment with sufficiently. A relatively straightforward 
extension concerns more richly structured graphs. 
If there are different edge types, some types may 
be interpreted as relating the adjacent nodes more 
closely than other types. In this case we would want 
nodes connected by such edges to be in a closer 
proximity. This could be enforced by giving dif- 
ferent edge types different weights and using the 
induced weighted topological distance as the pa- 
rameter of the decay function. 




Fig. 13. Mexican Hat 



The combination of the ISOM with additional layout constraints is worth 
further exploration. The basic idea is to supply a set of arithmetic constraints 
on the node positions against which updates must be verified. An update causing 
inconsitency must be rejected. As an alternative strategy the next best winner 
could be chosen. Instead of just checking consistency it may also be reasonable 
to accept — with a certain percentage of exceptions — only updates which reduce 
the degree of constraint violation. A constraint-enhanced ISOM would make it 
possible to explicitly avoid node clustering. More importantly, it would offer the 
possibility to take problem-specific layout constraints into regard. A layout of 
rooted trees, for example, should be fairly straightforward. It is clear, however, 
that this extension is computationally costly. 



From a theoretical perspective a statistical analysis of the network behaviour 
is desirable. So far we have found suitable values for the layout parameters by 
experimentation. It would be a great improvement if we could develop a statis- 
tically justified heuristics for estimating suitable parameter values. For this the 
notion of the network’s energy state would be useful. Though it is a customary 
and fruitful method to look at neural network learning methods from the point 
of view of energy minimization, this is rarely done for Kohonen networks. How- 
ever, some steps towards this are reported in E21[T2|. Such a notion could be 
the key to a thorough theoretical analysis of the ISOM and might even reveal 
formal connections to force-directed layout. 
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7 Conclusions 

The paper has introduced self-organizing graphs and the ISOM graph layout 
method which is based upon an extension of the competitive learning algorithm 
used in the self-organization of Kohonen networks. To our knowledge this is 
the first connectionist approach to automatic layout. We have presented an ex- 
perimental evaluation and extensions of the basic model to 3D-layout and to 
generalized layout spaces such as spherical surfaces. 

The advantages of the ISOM layout method, which has an extremely simple 
implementation, are its adaptability to different types, shapes, or dimensions of 
layout areas and even to different metric spaces. On top of this it consumes only 
little computational resources which renders it comparatively fast. 

Since graph layout is a computationally hard optimization problem, it is 
interesting to note that many good solutions for such problems, such as simulated 
annealing, force-directed models, genetic algorithms, neural networks, and more 
recently ant colonies |Zj are inspired by natural metaphors. So is the ISOM. 

We are hoping that in the future we will be able to support our intuitive 
understanding of the ISOM’s function by a more formal analysis based on the 
notion of an energy state of the network. 
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Abstract. Let G be a planar graph of n vertices, vi, . . . ,Vn, and let 
{pi , . . . ,pn} be a set of n points in the plane. We present an algorithm 
for constructing in O(n^) time a planar embedding of G, where vertex 
Vi is represented by point pi and each edge is represented by a polygonal 
curve with 0{n) bends (internal vertices.) This bound is asymptotically 
optimal in the worst case. In fact, if G is a planar graph containing at 
least m pairwise independent edges and the vertices of G are randomly 
assigned to points in convex position, then, almost surely, every planar 
embedding of G mapping vertices to their assigned points and edges to 
polygonal curves has at least m/20 edges represented by curves with at 
least m/40® bends. 



1 Introduction 

A planar embedding of a planar graph G is a drawing of G in the plane, where 
the edges are represented by simple continuous curves which intersect only at 
endpoints representing common vertices. By definition, every planar graph has a 
planar embedding. In 1948, Fary showed that every planar graph has a straight- 
line embedding, i.e., a planar embedding where every edge is represented by 
a single line segment. Numerous algorithms exist for constructing straight-line 
embeddings. These algorithms construct and report a set of locations for the 
vertices of G. The line segments representing the edges of G are implicitly given 
by the locations of their endpoints. 

In this paper, we consider the problem of constructing a planar embedding 
where vertices are mapped to prespecified fixed locations. If {vi, . . . ,Vn} is the 
vertex set of G and {pi, . . . ,pn} is the set of vertex locations, then each Vi must 
be mapped to the point pi. In most cases, the edges of G can no longer be 
represented by disjoint line segments. However, they can be represented by non- 
crossing polygonal curves. What is the complexity of these polygonal curves, i.e., 
how many bends (internal vertices) must they have? By a slight modification of 
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the algorithm presented in Souvaine and Wenger jSj, we obtain that if G is a 
single path, then it has a planar embedding with a total of O(n^) bends. Here 
we prove a stronger result, which answers a question asked by Richard Pollack. 

Theorem 1. Every planar graph on n vertices admits a planar embedding which 
maps each vertex to an arbitrarily prespecified distinct location and each edge to 
a polygonal curve with 0(n) bends. 

Moreover, such an embedding can be constructed in 0{n^) time. 

We apply an idea of Pach, Shahrokhi, and Szegedy |S] to show that the 
bound in Theorem 1 is tight. We say that two edges of a graph are independent, 
if they do not share an endpoint. A set of pairwise independent edges is often 
called a matching. A set of n/2 pairwise independent edges in a graph with n 
vertices is called a perfect matching. 

Theorem 2. Let G be a planar graph of n vertices, v\, . . . ,Vn, which contains at 
least m pairwise independent edges and let {pi, . . . ,p„) be a random permutation 
of the vertices of a convex n-gon. 

Then, as n tends to infinity, in every planar embedding of G which maps Vi 
to Pi and the edges to polygonal curves, there are almost surely at least mf 20 
edges represented by curves with at least to/40^ bends. 

Note that a path of length n or a perfect matching of n vertices have \ji/2\ 
pairwise independent edges. 

The nature of the problem drastically changes if we only require that {ui, . . . , 
Vn} be mapped to a set {pi , . . . ,p„} of points in general position but we do not 
insist on the particular order. It is known that in this case there always exists 
a straight-line embedding which maps Vi to PT^(i), 1 < * < u, for a suitable 
permutation tt (see 0. E) 

2 Embedding Algorithm — Proof of Theorem 1 

Let G be a planar graph with vertex set V = {ui, . . . , u„}, and let {p\, . . . ,pn} 
be a set of n points in the plane. In this section, we give an algorithm for 
constructing a planar embedding of G such that vertex Vi is represented by 
point Pi and each edge of G is represented by a polygonal curve with 0(n) 
bends (internal vertices.) Our algorithm runs in O(n^) time. 

A Hamiltonian cycle is a cycle which visits each vertex of the graph exactly 
once. A planar embedding of a cycle divides the plane into a bounded and an 
unbounded component. If the edges are represented by polygonal curves, then 
the bounded component is a simple polygon. 

The general outline of our algorithm is as follows. We first bound the number 
of bends in a polygonal curve which follows the boundary of a tree at constant 
distance under the h metric (Lemma [Q). We next assume that G contains a 
Hamiltonian cycle and show how to construct a planar embedding of G mapping 
each vertex Vi to point pi (Lemma EJ. Finally, we show how to add vertices and 
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edges to a planar graph forming a new planar graph which has a Hamiltonian 
cycle (Lemma 0. Combining the last two results gives a proof of Theorem 1. 

To construct the planar embedding of a graph containing a Hamiltonian cycle, 
we construct a polygonal, planar embedding of a tree whose leaves are the points 
Pi, . . . Curves representing the edges of the cycle will follow the boundary 
of this tree or one of its subtrees at a fixed distance. We use the li metric to 
measure this distance. The distance between points (xi,yi) and (x 2 ,y 2 ) in the 
li metric is |a;i — X 2 I + |yi ~ 2 / 2 |- Let be the ball of radius e in the h metric. 
The Minkowski sum of a point set S and B^, written S + B^, is the set of all 
points which are at distance less than or equal to e from some point in S. 

Lemma 1. Let T be a straight-line embedding of a tree with N > 1 vertices. If 
e > 0 is smaller than the li-distance between any line segment (edge) e and any 
vertex of T not incident to e, then T + B ^/2 is a simple polygon whose boundary 
contains at most 4N — 2 vertices. 

Proof. Each vertex of the polygon T + B^f 2 is either a vertex of u + B ^/2 for 
some vertex u of T or lies on the intersection of the boundary of e + B ^/2 and 
e' + B ^/2 for two adjacent edges e, e' of T. The adjacent edges e and e' share 
some common endpoint u. Thus each vertex of the polygon T + B ^/2 c^^n be 
associated with a vertex u of T. The number of vertices associated with u is at 
most 2 + where > 1 is the degree of u. A tree has — 1 edges so the sum 
of the degrees of all tree vertices is 2N — 2. Thus summing 2 + d„ over all vertices 
of the tree gives a bound of 4iV — 2 on the number of vertices of T + B^/ 2 . □ 

An embedding of a graph is outerplanar if it is a simple closed curve with 
some non-crossing internal diagonals. The unbounded face of such an embedding 
corresponds to the (uniquely determined) Hamiltonian cycle of the graph. A 
graph is outerplanar if it has an outerplanar embedding. Mapping the vertices 
of an outerplanar graph to vertices of a convex n-gon in the order (clockwise 
or counter-clockwise) that they appear in the Hamiltonian cycle and its edges 
to line segments between n-gon vertices gives a straight-line planar embedding 
of the outerplanar graph. Every planar graph containing a Hamiltonian cycle 
can be divided into two outerplanar graphs which have only the edges of the 
Hamiltonian cycle in common. 

We are ready to construct the planar embedding of a graph containing a 
Hamiltonian cycle. 

Lemma 2. Let G be a planar graph ofn vertices, v\, . . . ,Vn, containing a Hamil- 
tonian cycle C and let {pi, . . . ,pn} be a set ofn distinct points in the plane. 
Graph G has a planar embedding such that 

(i) every vertex Vi of G is represented by the point pi; 

(ii) every edge of G is represented by a polygonal curve with at most 8n -I- 9 
bends. 

Moreover, such an embedding can be constructed in 0{n^) time. 
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Fig. 1. Tree T and the simple polygonal curve T through pi, ... 



Proof. By relabeling vertices and points, we may assume that the vertices of G 
appear in the Hamiltonian cycle C in the order (ui, . . . ,w„). We first construct 
an embedding of the edges of C. 

Let p* be some point in the plane to the right of p\ and in general position 
with respect to all the points pi, i.e., the line through any two distinct points, 
Pi and pj , does not contain p* . In addition, p* should not share an x-coordinate 
with any of the points pi. Let Ti be the line segment (p*,pi) and Ti be the union 
of Ti-i and (p*,pi). Set T equal to T„. Note that the leaves of Ti are pi, . . . ,Pi 
and that Ti C T 2 C ■ ■ ■ C Tn = T. The edges of the cycle will be routed around 
the embedded trees Ti. 

Let e > 0 be the smallest distance under the l\ metric between any pi and 
any line segment {p* ,pj), where j yf i. Let 5 equal e/(4n). Draw a simple closed 
polygonal curve T through the points pi, . . . in sequential order as follows. 
Start by drawing a vertical line segment from pi to a point at distance 6 above 
Pi. From this point, draw a polygonal curve clockwise around T 2 following the 
boundary of T 2 at a distance of 6 in the h metric. Stop when this polygonal 
curve crosses the vertical line through p 2 and draw a vertical line segment of 
length 3i5 through p 2 - The endpoint of the curve is now a distance 26 from p 2 . 

For z = 2 to n — 1, draw a polygonal curve following the boundary of Ti+i 
at distance of i6 in the li metric. Stop when this the curve crosses the vertical 
line through Pi+i and connect it to a vertical line segment of length (2z + l)i5 
through Pi+i. There are two possible such curves, one clockwise and one counter- 
clockwise around Tz+i. Choose the one which does not cover p\. Note that at a 
distance of i5 from Ti+i, the curve will not intersect any of the previously drawn 
curves. 

Complete T by drawing a polygonal curve clockwise around T at a distance 
of n5, and finally connecting the curve to p\ by a vertical line segment below 
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P\. (See FigureD) The simple closed polygonal curve F will be the image of the 
Hamiltonian cycle C in our embedding. Note that p* lies in the bounded region 
defined by F and that the points p\, . . . ,pn lie in clockwise order around F . 

Except for the two line segments on either end, the curve from pi to Pi+i is 
a subset of Ti + Bis- Tree is a subtree of T which has at most n + 1 vertices. 
By Lemmad the curve from pi to pi+i has at most (4(n + 1) — 2) + 2 = 4(n + 1) 
bends. Similarly, the curve from pn to pi has at most 4(n + 1) bends. 

We now construct pairwise disjoint auxiliary paths between the points pi 
and pi- For positive integers i < n and k < n — 1, let Ai^k be the boundary 
of Ti + B(^i_i')S+kS/n- The polygonal curve Ai^k intersects F at only two places, 
once on a line segment incident with pi and once on a line segment incident with 
Pi- (See Figure El) Thus, F divides Ai^k into two polygonal curves, one in the 
bounded region and one in the unbounded region defined by F. Truncate these 
curves at a distance S from their endpoints and connect their new endpoints to 
Pi and Pi by line segments, thus forming two paths between pi and pi. One of 
these paths lies in the interior of the bounded region and one in the interior 
of the unbounded region defined by F. Label these paths A'- ^ and in the 
bounded and unbounded regions, respectively. These paths are pairwise disjoint 
except perhaps at their endpoints. Applying Lemma D shows that these paths 
have at most 4(n + 1) bends. 





Fig. 2. The polygonal curve A^^k and paths Ag j. and Ag j,. 



We are finally ready to embed the remaining edges of G. Divide G into two 
outerplanar graphs G' and G" which have only the edges of Hamiltonian cycle 
C in common. Let E' and E" be the edges of G' and G", respectively, which are 
not in C. 

Draw a circle of radius 5 /2 in the Euclidean norm around p\ and intersect 
it with each of the A' and A''^,. Since the paths from pi to p 2 and Pn start 
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with vertical line segments above and below pi, the curve F divides F into two 
semi-circles, F' and F” in the bounded and unbounded regions defined by F. Let 
g- j. and q”f. be the intersection points of A'- f. and with F, respectively. 

For integers i,j, let f(i,j) be the unique non-negative integer less than n 
which is congruent to {i — j) mod n. We represent an edge {vi,Vj) G E' by 
a path from pi along to q'ij(ij), then a line segment from to 

g' /(j i) 1 finally a path along A'^ from g' to pj. This path contains 
at most 4(n + 1) + 4(n -I- 1) -I- 1 = 8n -I- 9 bends. (See Figure Elfor an example of 
a complete embedding of a graph.) 

We claim that this embedding of G' is planar. By construction, the paths vl' 
intersect or touch F only at their endpoints. Thus the only possible violations 
of planarity occur in the line segments that we drew in F' . 

Since G' is outerplanar, mapping its vertices to vertices of a convex n-gon 
and its edges to a line segments between vertices gives a planar embedding of 
G' . The points on <P' form vertices of a convex polygon and lie in the clockwise 
order 

Pi: ^2,1: ■ ■ • : ^2,71—1: ^3,1: ' ' ' : ^3,n— 1: ■ ' ’ : : ■ • ■ : Qn,n—1 ‘ 

For each edge (v^,Vj) of E\ we drew the line segment (?i,/(iy): 'Jy/Q,*)) 
in F' . If instead we drew the line segment (g']^,g'^) in the convex n-gon 
(pi, g( g 2 1 , . . . , g(, i) for each (uj, Vj) G if', we would have a planar embedding 
of G' . If (vi,Vj) and {vi>,Vj') are two edges in E' and Vi,Vj,Vi> and Vji are dis- 
tinct, then ™ around 

<P' as g'_i, g' 1 , q'^, -^, g', ^ Since line segments (g'_i,g'_i) and (g',_i, g',_i) are pair- 
wise disjoint, so are line segments 9y/(j,i)) and (?(/ j/): 

If (vi,Vj) and (vi,Vj') are two edges in E' sharing the vertex u,, then the rel- 
ative positions on F' of and ensure that (9( and 

pairwise disjoint. 

An edge (vi,Vj) G E" is represented by a path from pi along to 

followed by a line segment from q”ji^ij) to and finally a path 

from q” fQ q to pj. This path also contains at most 4(n-|-l)-|-4(n-|-l)-|-l = 8n-|-9 
line bends. A similar argument to the one for E' shows that this embedding of 
E" is planar. The only difference is that the points lie in counterclockwise order 
around F" . 

Finding a point p* in general position and calculating the value of e can be 
done in 0(n log n) time. Constructing the polygonal curves from pi-i to pi and 
from pn to Pi takes 0{n) time per curve. Constructing the polygonal curves 
representing edges in E' and E” also take 0{n) time per edge for a total of 
O(n^) time. □ 

We now show how to turn a planar graph G into a planar graph FI containing 
a Hamiltonian cycle. 

Lemma 3. Let G be a planar graph of n vertices. By subdividing edges of G 
by at most two new vertices and adding some edges between vertices, we can 
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construct from G a planar graph H which has at most bn — 10 edges and contains 
a Hamiltonian cycle. This construction can be accomplished in linear time. 

Proof. If G has fc > 1 connected components, add k — 1 new edges to form a 
connected graph G'. Construct a planar embedding of G', not necessarily with 
straight-line edges. Let S' be a spanning tree of G'. Clearly, all edges of G' — G 
belong to S. (See Figure El) 




Fig. 4. Graph G and a spanning tree S. 



Starting at any vertex, walk clockwise around S, visiting its vertices in order. 
Note that the internal vertices of S will be visited more than once. Label the 
vertices with wi,W 2 ,. ■ . ,w„, by the order, in which they are first visited. If Wi 
and Wi+i are connected by an edge, then let this edge belong to the Hamiltonian 
cycle (1 < J < n, and we use the convention Wn+i ■= w\). If not, connect Wi 
to Wi+i by a simple curve clockwise around the boundary of S, passing very 
close to it. Wherever this curve intersects an edge of G', introduce a new vertex. 
Thus, this curve becomes a path whose pieces are added as edges to the graph 
and to its Hamiltonian cycle. Merge any multiple edges, and call the resulting 
graph H. (See Figure 0) 





Fig. 5. Constructing the Hamiltonian cycle and graph H. 



Each edge of G' was split at most twice. Since G' has at most 3n— 6 edges and 
the n— 1 edges of S were never split, H has at most n-|-2(2n— 5) = 5n— 10 vertices. 
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Connecting G to form G' , constructing a planar embedding and a spanning tree 
S of G' take 0{n) time. Walking around S to construct the Hamiltonian cycle 
can also be done in linear time. □ 

Proof of Theorem 1: Starting with a planar graph G of n vertices, form a planar 
graph H on at most 5n — 10 vertices containing a Hamiltonian cycle as outlined 
in Lemma 0 Only the n original vertices from G have prespecified locations, but 
we can arbitrarily assign locations to the new vertices in H. Applying Lemma 0 
gives a planar embedding of H, whose edges are polygonal curves with at most 
8(5n— 10)+9 < 40n bends. We construct an embedding of G from the embedding 
of H by merging polygonal curves which correspond to portions of edges of G 
and by deleting polygonal curves which are not part of any edge of G. Each edge 
of G was split into at most 3 edges in H, so each edge in G can be represented 
by a polygonal curve with at most 3 * 40n = 120n bends. 

Constructing H takes 0(n) time. Embedding H in the plane takes 0{nf) 
time. Constructing the embedding of G from the embedding of H takes time 
proportional to the size of the embedding of H which is 0{nf). Thus, our algo- 
rithm runs in 0{nf) time. □ 

A more generalized version of Lemma 0 is also true. Given a Hamiltonian, 
planar graph H' of n vertices where only fc > 1 vertices of H' have preassigned 
point locations, it is possible to represent every edge of H' by a polygonal curve 
with at most 8fc-|- 9 bends. Applying this bound to H in the proof of Theorem 1 
gives a planar embedding of H with at most 8n -|- 9 < 8(n -|- 2) bends per curve. 
This results in each edge of the original graph G being represented by curves 
with at most 3 * 8(n + 2) = 24(n -|- 2) bends. 

3 Lower Bound — Proof of Theorem 2 

Let iJ be a graph with vertex set V{H) and edge set E{H). The bisection 
width b{H) of H is the minimum number of edges running between V\ and 
V 2 , over all partitions of the vertex set V{H) into two parts Vi U V 2 such that 
|ki|,|V 2 | > |V(iL)|/3. Roughly speaking, the bisection width is the minimum 
number of edges whose deletion splits the graph into two approximately equal 
parts. 

The crossing number c{H) of H is the minimum number of crossing pairs of 
arcs in any planar representation of H, where the vertices are mapped to distinct 
points and the edges are represented by simple continuous arcs connecting the 
corresponding points and not passing through the image of any other vertex. 

Leighton 0 discovered that the above parameters are closely related. A 
somewhat more general form of his result was established in 0. 

Lemma 4. Let H be a graph of n vertices, whose degrees are d\, . . . ,dn- Then 

n 

16c(G) + ^rf2 
2=1 




62(G) < (1.58)2 
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The following lemma bounds the bisection width of a random graph which 
is constructed from the union of a cycle and a random perfect matching. 

Lemma 5. Let Hn be a randomly defined graph with vertex set V{Hn) = 
{ui, . . . , Un}, whose edge set is the union of {uiU 2 , U 2 M 3 , . . . , UnUi} and a ran- 
dom perfeet matehing of the vertices (n is even). 

Then, as n tends to infinity, the bisection width of Hn is almost surely at 
least nf 20. 

Proof. The number of different graphs that can be obtained as Hn is equal to 
the number of perfect matchings on n vertices, ^n/ 2 )\ 2 '^/‘^ ’ these graphs are 

equally likely. 

Next we estimate that at most how many of these graphs have bisection 
width at most k, for some fixed k < n/3. Consider the cycle C = u\...Un C Hn. 
The number of partitions V{Hn) = Vi U C 2 , for which at most k edges of C run 
between Vi and V 2 , is at most 

These edges cut C into at most k intervals, belonging alternately to Vi and to 

fo 2 . 

For a fixed partition with 2n/3 > |Vi| = rii > IV 2 I = n 2 > n/S, the number 
of perfect matchings of V{H) with at most k edges running between Vi and V 2 
is at most 

/nfo /n 2 V., (ni-f)! {u 2 - i)l 

> + /n/3\,, (2n/3-fc)! (n/3-fc)! 

^ ^ k M k p/3-fc^igIlZ^- 

Indeed, one can choose i points from Vi and i points from V 2 in ("f ) different 
ways, and match them in i\ ways. The number of matchings on the remaining 

Uj — i points of Vj is at most — ; j = 1,2. Thus, the probability that 

(^)! 2 ^ 

b{Hn) < fc is at most 

/2n/3\ /n/3V | (2n/3-fc)! (n/3-fc)! / nl 

WV k )\k r-( 2 n/ 3 -fc )! 2 giiZg^ p/3-fc ^io:V^/ (n/ 2 )! 2 "/ 2 ’ 

which tends to zero when k = [n/ 20 j and n — > 00 . □ 

Proof of Theorem^ : Let P = {pi, . . . ,Pn} be the set of vertices of a convex n- 
gon in the plane, listed in clockwise order. Let G be a planar graph on the vertex 
set C(G) = {ui, . . . , u„}, and let / : fo(G) ^ P be a randomly chosen bijection. 
Suppose that G has m pairwise independent edges, V 1 V 2 , . ■ . ,V 2 m-iV 2 m- 
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Let H 2 m be a graph on the vertex set V {H 2 m) = f{v 2 m)}, con- 

structed as follows. If /(fii), • ■ • , f{vi 2 m) is the list the elements of V{H 2 m) in 
clockwise order around P, then let 



E{H 2 m) = I 1 < j < 2m} U {f {v 2 i-i) f {v 2 i) I 1 < i < mj, 

where i 2 m+i '■= *i- Clearly, H 2 m is isomorphic to the graph described in 
Lemma 5. In particular, almost surely we have b{H 2 m) P m/10. 

Suppose now, in order to obtain a contradiction, that G has a planar em- 
bedding which maps Vi to f{vi), 1 < * < n, and every edge is represented by a 
polygonal curve such that at most ml2Q edges are represented by curves with 
at least m/40^ bends. If, for some 1 < j < m, V 2 i-iV 2 i is represented by a curve 
with at least m/40^ bends, then remove f {v 2 i-i) f (v 2 i) from the graph H 2 m- 
The bisection width of the resulting graph almost surely satisfies 

> b{H 2 m) ~ m/20 > m/10 - m/20 = m/20. 



Hence, applying Lemma 4 with di < 3 (1 < z < m), we obtain that almost surely 



c(ddL) > 



16 V (1.58)2 



— 9m I > 



(m/20)2 

40 



— m = 



40 X 202 



On the other hand, in the above planar embedding of G, all paths representing 
the edges of H'^ have fewer than m/40^ bends. Adding the edges of the convex 
hull of ),•■■, /(fiam) to the collection of these paths, we obtain a planar 
representation of with at most m x (m/40^ -|- 1) x 2 < 2vn? /AQ^ + 2m 
crossings, because each line segment in a polygonal path can cross at most two 
edges of the convex hull of P. Thus, 



40 X 202 



- m < c{H^) < 



402 X 20 



2m, 



a contradiction for suitably large m. 



□ 



4 Remarks 

This paper discusses the worst case complexity of constructing a polygonal pla- 
nar embedding of a graph G of n vertices, vi,. . .,Vn, where each vertex Vi is 
mapped to a prespecified point pi. The corresponding optimization problem is 
the following: given a planar graph G with vertex set {z)i, and a point 

set {pi, . . . ,Pn}, construct an embedding of G which maps Vi to pi, 1 < i < n, 
using as few bends (in total) as possible. Bastert and Fekete PJ proved that this 
problem is NP-hard. Is there an approximation algorithm for this problem which 
gives a solution within a factor of the optimal one? 

Bastert and Fekete actually proved that minimizing the total number of 
bends is NP-hard in the case when G is a perfect matching. However, the com- 
plexity of the problem is unknown if G is a simple path, or, more generally. 
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any connected graph. The complexity is also unknown if the points pi, . . . ,pn 
are required to be in convex position (vertices of a convex polygon.) Are these 
restrictions of the problem also NP-hard? 

The I7(n^) worst case lower bound in Theorem |2| assumes that the points 
Pi, . . . ,Pn are in convex position. What if the points are not in convex position, 
say, they form a ^/n x ^yn grid? Are there examples of assignments of vertices of 
a planar graph G to the vertices of such a grid such that any polygonal planar 
embedding of G requires a total of J7(n^) bends? 

References 

1. Bastert, O., and Fekete, S. Geometrische Verdrahtungsprobleme. Technical 
Report 96.247, Angewandte Mathematik und Informatik, Universitat zu Koln, Koln, 
Germany, 1996. 

2. Gritzmann, P., Mohar, B., Pack, J., and Pollack, R. Embedding a planar 
triangnlation with vertices at specified points. American Mathematical Monthly 98 
(1991), 165-166. (Solution to problem E3341). 

3. Leighton, F. Complexity Issues in VLSI. MIT Press, Cambridge, 1983. 

4. Pach, J., and Agarwal, P. Combinatorial Geometry. John Wiley and Sons, New 
York, 1995. 

5. Pach, J., Shahrokhi, F., and Szegedy, M. Applications of the crossing number. 
Algorithmica 16 (1996), 111-117. (Special Issue on Graph Drawing, edited by G. 
Di Battista and R. Tamassia). 

6. Souvaine, D., and Wenger, R. Constructing piecewise linear homeomorphisms. 
Technical Report 94-52, DIMACS, New Brunswick, New Jersey, 1994. 



Proximity Drawings: 

Three Dimensions Are Better than Two* 

(Extended Abstract) 



Paolo Penna^ and Paola Vocca^ 

^ Dipartimento di Scienze dell’Informazione, Universita di Roma “La Sapienza”, 
Via Salaria 113, 1-00198 Rome, Italy. 
pennaSmat .uniroma2 . it, 

^ Dipartimento di Matematica, Universita di Roma “Tor Vergata”, 

Via della Ricerca Scientifica, 1-00133 Rome, Italy. 
vocca@axp.mat .uniroma2 . it 



Abstract. We consider weak Gabriel drawings of unbounded degree 
trees in the three-dimensional space. We assume a minimum distance 
between any two vertices. Under the same assumption, there exists an 
exponential area lower bound for general graphs. Moreover, all previously 
known algorithms to construct (weak) proximity drawings of trees, gener- 
ally produce exponential area layouts, even when we restrict ourselves to 
binary trees. In this paper we describe a linear-time polynomial-volume 
algorithm that constructs a strictly-upward weak Gabriel drawing of any 
rooted tree with 0(logn)-bit requirement. As a special case we describe 
a Gabriel drawing algorithm for binary trees which produces integer co- 
ordinates and -area representations . Finally, we show that an infinite 
class of graphs requiring exponential area, admits linear-volume Gabriel 
drawings. The latter result can also be extended to /3-drawings, for any 
1 < /3 < 2, and relative neighborhood drawings. 



1 Introduction. 

Three-dimensional drawings of graphs have received increasing attention re- 
cently due to the availability of low-cost workstations and of applications that 
require three-dimensional representations of graphs 0 E31 d E2 Eg. Even 
though there are several theoretical results 011100 , there is still the need for 
a better theoretical understanding of three-dimensional capabilities. 

In this paper we tackle the problem of drawing proximity drawings in the 
three-dimensional space. Proximity drawings have been deeply investigated in 
the two-dimensional space because of their interesting graphical features (see, 
e.g. |T7iEioiTnirrniiiiTU]). Nevertheless, only preliminary results are available 
for three-dimensional proximity drawings ng. 

* Work partially supported by the Italian Project MURST- “Algorithms and Data 
Structure” 
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In the three-dimensional space, a proximity drawing is a straight-line drawing 
where two vertices are adjacent if and only if they are neighbors according to 
some definition of neighborhood. One way of defining a neighborhood constraint 
between a pair of vertices is to use a proximity region, that is a suitable region 
of the plane having the two points on the boundary. Two vertices are adjacent if 
and only if the corresponding proximity region is empty, i.e., it does not contain 
any other vertex of the drawing (however, an edge of the drawing may cross 
the proximity region). For example, two vertices u and v are considered to be 
neighbors if and only if the closed disk having u and v as antipodal points, is 
empty. Proximity drawings that obey this neighborhood constraint are known in 
the literature as Gabriel drawings (iniira]) and the closed disk is called Gabriel 
disk. In a relative neighborhood drawing |23j two vertices u and v are adjacent 
if there is no other point whose distance is strictly less than the Euclidean 
distance between u and v. A generalization of Gabriel and relative neighborhood 
drawings is represented by (3-drawings, where the proximity region is defined by 
the parameter (3. /3-drawings were first introduced by Kirkpatrick and Radke 
in the computational morphology context. 

In the three-dimensional space the definition of proximity drawings is an 
obvious extension of that for the two dimensional case. In particular, we refer to 
Gabriel proximity region as a Gabriel sphere instead of disk. 

In |3j, the weak proximity drawings have been first introduced. A weak proxim- 
ity drawing is a straight-line drawing such that for any edge (it, v) the proximity 
region of u and v is empty. This definition relaxes the requirement of classical 
/3-drawings, allowing the /3-region of non-adjacent vertices to be empty. 

As we will show in this paper, this weaker definition of proximity yields more 
efficient drawing algorithms while preserving the graphical features of strong 
proximity drawings (e.g. edges are represented as straight lines; vertices not 
incident to a certain edge are drawn far apart from that edge). 

Unfortunately, all known algorithms that compute (weak) proximity draw- 
ings produce representations whose area increases exponentially with the number 
of vertices. As a consequence, the problem of constructing proximity drawings 
of graphs that have small area is considered a very challenging one by several 
authors (see j3iini[Ei). Additionally, in m an exponential lower bound on the 
area of Gabriel drawings has been presented. 

In this paper we shown that the use of the third dimension can substantially 
help in improving the efficiency of the proximity drawings. More precisely, the 
results we achieve in this paper are listed below: 

— We describe a linear time polynomial volume algorithm for strictly upward 
weak Gabriel drawing of unbounded degree trees, where the coordinates of 
vertices can be represented with 0(log n)-bits; 

— We give a n^-area algorithm for strictly upward weak Gabriel drawing of 
binary trees where all vertices have integer coordinates; 

— We present an infinite class of graphs such that any Gabriel drawing (both 
strong and weak) of a graph in the class requires area exponential in the 
number of vertices, while admits linear volume strong Gabriel drawing; 
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— We extend the above result to /3-drawings for 1 < /3 < 2 and to relative 
neighborhood drawings; 

In all algorithms we present we assume a minimum distance between vertices, 
which imposes that dimensions cannot be arbitrarily scaled down. 

2 Preliminaries. 

A three-dimensional layered drawing of a tree T is a drawing such that each 
vertex is placed on equally spaced layers, being a layer a plane orthogonal to 
the z-axis. In the following we denote with 6 the distance of any two consecutive 
layers and with layer i the plane given by the points whose z coordinate is equal 
to Si. Thus, a layered drawing is a drawing such that the z-coordinate of the 
vertices takes value in {i5, 2S, . . . ,iS, . . .}. The height^ the width and the depth of 
a layered drawing are defined as the height, width and depth of the smallest 
isothetic parallelepiped bounding the drawing. Given a vertex a we denote with 
La the layer on which the vertex is drawn. 

Given two points in the three-dimensional space, we denote with /3] 

the jd-region of influence of a and b. For 0 < /3 < 1, R[a,b,fl] is the intersection 
of the two closed spheres of radius d{a,b)/{2fl) passing through both a and b. 
For 1 < /3 < oo, i?[a, b, /3] is the intersection of the two closed spheres of radius 
/3d(a, b) and centered at the points (1 — /3/2)a -I- (/3/2)6 and (/3/2)a -I- (1 — Pl2,)b. 
A weak (3-drawing for a graph G is a drawing of G such that for each pair of 
adjacent vertices a and b, the proximity region R[a, 6, (3\ does not contain any 
other vertex of the drawing. If the proximity region of any two non adjacent 
vertices contains at least another vertex of the drawing then the drawing of G is 
a strong (3-drawing or simply (3-drawing. A (weak) Gabriel drawing is a (weak) 
/3-drawing with /3 = 1. In this case, the proximity region of any two points a 
and b is denoted by R[a, b] and corresponds to the closed sphere centered at the 
middle point between a and b whose radius is d(a,b). Proximity regions of a 
2-dimensional drawing are similarly defined as the intersection of closed disks. 

Finally, to simplify the notation, we denote a vertex and a point representing 
it with the same symbol. Additionally, let u, be a vertex. We denote by Xu, yu 
and z„ its x-, y-, and z-coordinates. 

3 The Algorithm. 

In this section we describe a linear-time n^-volume algorithm that constructs a 
strictly-upward weak Gabriel drawing of any rooted tree T with n nodes. The 
correctness of the algorithm will be proved in Sect. El 

The construction of the drawing is performed in two phases. In Phase 1 we 
construct an upward straight-line layered drawing of T in the yz plane. This will 
be the “front” of a three-dimensional drawing. Indeed, in Phase 2, we assign 
different x-coordinates to the vertices, so that the children of a vertex are at the 
same distance from the parent. This will be performed by simply moving the 
subdrawings along to the x-direction. 
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3.1 Phase 1: The Front Drawing. 

In the first step we construct an upward straight-line layered drawing of T on the 
j/z-plane (i.e. all the vertices have null x coordinate). The value of the distance 
5 of two consecutive layers will be specified in Sect.0 

We want our drawing to satisfy the following two invariants: 

1. Each edge connects vertices on consecutive layers. 

2. Each internal vertex is at the same distance from its leftmost and its right- 
most child. 



algorithm front_drawing(T) 
h <— height of T 
r <— root of T 

if h = 1 then 

draw r on layer 1 with null y-coordinate 

else begin 

Ti <— largest immediate subtree of T 

ri, . . . ,rk <— roots of Ti , . . . , children of r 

for i = 1 to A: do 

Ai =front_drawing(Ti) 
translate Zii so that ri is on layer h — 1 
for i = 2 to A: do 

translate At so that: 

1. ri is on layer h — 1, and 

2. Ai is at unit distance from Z\i_i 

draw r on layer h at the same distance from ri and rt 

connect r to ri , . . . , 

end 

end 



Fig. 1. Phase 1: Algorithm front_drawing. 



The algorithm in Fig. ^ constructs a drawing of tree T having as imme- 
diate subtrees Ti, . . . , Tfe, by first recursively drawing Ti, . . . , Tfe, and then by 
rearranging the subdrawings so to satisfy Invariants Q] and |2| (see Fig. 0a)). 

It is easy to see that the algorithm in Fig. [H computes in linear-time a lay- 
ered drawing on the j/z-plane that satisfies both the previous two invariants. 
Moreover, the width (respectively, the height) of the drawing is at most n (re- 
spectively, n^), where n is the number of nodes of the tree. 
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3.2 Phase 2: Equally Space the Children. 

Let u be an internal vertex of T and v\,...,Vk its children. In this phase we 
assign different ^-coordinates to vertices v\, . . . ,Vk so that all edges (u, Vi), with 
1 < z < fc, have the same length. Let D{u) be the disk on the layer containing 
vi, . . . ,Vk and having as antipodal points vi and Vk (see Fig. 01(b)). We translate 
V 2 , ■ ■ ■ ,Vk-i along the x-direction until they meet the boundary of D{u) (see 
Fig. 0(b)). 

Algorithm move(T) in Fig. |2 implements the above strategy in linear time. 



algorithm move(T) 
r <— root of T 

ri, ... ,rk <— roots of Ti, . . . ,Tk children of r 
d = d(n, rfc) 

for i = 2 to fc — 1 do begin 

Xrj = Xr + \J d?' 

end 

for i = 1 to fc do begin 

move(rr^ ) 

end 



Fig. 2. Phase 2: Algorithm move. 




(a) 



(b) 



Fig. 3. (a) Phase 1: the front drawing of T. (b) Phase 2: how to equally space 
the children from the parent. 
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4 Proof of Correctness. 

In this section we prove that the drawing obtained after Phase 2 is a weak Gabriel 
drawing for any tree T given in input. Moreover, its volume is n^, where n equals 
the number of nodes of T. To this aim we first need some technical lemmas. 

Lemma 1. Let the distance 5 of any two consecutive layers be at least n. For 
any two adjacent vertices a and b, R[a,b] intersects only layers La and L^. 

Proof. It is easy to see that the length of any edge is at most \/rif + nf + ri^ = 
\/3n. On the counterpart the distance from the center of i?[a, b] to any layer 
other than La or is 3nf2, which implies that such a layer does not intersect 
i?[a, b]. 

Hereafter, we assume 6 = n. 

Lemma 2. Let a, b and a, b be three points such that d{a, b) = d(b, c). Then, 
c ^ R[a, b] . 

For any two points a and b on the same layer L, we denote by D[a,b] the 
closed disk on L having a and b as antipodal points. 

Lemma 3. Let a and b be any two points and let b' be the projection of b on 
layer La- Then, 



R[a, b]n La = D[a, b']. 

From now on, we identify a vertex r with the point of the space that represents 
r in the three-dimensional drawing of Phase 2. Additionally, we denote with S(r) 
the semi-infinite strip of the space of minimal width containing all the vertices 
in the subtree rooted at r (see Fig.Efb)). 

Lemma 4. Let u be any vertex and let v\, . . . ,Vk be its children. Then the fol- 
lowing relations hold: (a) R[u,Vi] (1 Ly., C S{u). (b) R[u,Vi] D Ly C S{u). 

Proof. We distinguish the two parts of the lemma. 

(a) R[u, Vi] n c S{u). 

Let us consider the disk D{u) containing v\,. . . ,Vk (see Fig. 0). We apply 
Lemma 0 with a = Vi and b = u. If we denote with u' the projection of u on 
then we have: 

R[u, Vi] n Ly^ = D[u', Vi] C D{u) C S{u). 

(b) R[u,Vi\ n Ly C S{u). 

Let D'(u) be the disk D{u) translated on and centered at u. Also, let w' 
be the projection of Vi on By applying Lemma 0 with a = u and b = Vi 
we obtain: 



R[u,Vi] n Ly = D[u,v'i] C D'{u) C S{u). 



The lemma thus follows. 
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We are now in a position to prove that the algorithm described in the previous 
section correctly constructs a weak Gabriel drawing for any tree T. 

Lemma 5. The drawing obtained after Phase 2 is a weak Gabriel drawing. 

Proof. We have to prove that, for any edge (u,Vi), the proximity region R[u,Vi] 
does not contain any other vertex. Suppose, by contradiction, that a vertex v* 
falls within R[u, Vi\. Then, from LemmaDl only one of the following two cases is 
possible: 

V* G L„. {v* is on the same layer ofvi,... ,Vk) 

From Lemma 0 v* must fall within S{u). But the only vertices in S{u) nL„. 
are the children of u. Thus, d{u,v*) = d{u,Vi). From Lemma 0 v* ^ Ly., 
thus a contradiction. 

V* G Lu (v* is on the same layer of u) 

It is easy to see that it must be S{v*) n S{u) = 0 (see Fig. From Lemma 
EJwe have that v* ^ R[u,Vi], thus a contradiction. 

Let us now consider the size of the drawing. 

Lemma 6. The volume of the drawing obtained after Phase 2 is at most . 

Proof. It is easy to see that the height is at most n^, and the width is less than 
or equal to n. We have to prove that also the depth depth{T) is at most n. The 
proof is by induction on n. 

Step base (n = 1) . Trivial. 

Inductive step. Let us suppose the lemma holds for trees with at most n — 1 
nodes, and let T be an n node tree. Let also Ti , . . . , be its immediate 
subtrees, whose number of vertices are ni, . . . , respectively. Let also sup- 
pose T\ be the larger immediate subtree. Then, from the algorithm move and 
considering that the initial width is at most n, we have that (see Fig. Elb)): 



depth{T) < ma.x {depth{Ti), n/2 -|- depth{T 2 ), . . . , nj2 + depth{Tk)} 

< max{ni, n/2 -|- n^} < n, 

where the second last inequality follows from the inductive hypothesis, and 
the last one comes from n^ < n/2, for 2 < i < k. 

By combining Lemma Eland Lemma 0 we get the following result. 

Theorem 1. Any tree admits a three-dimensional weak proximity drawing of 
volume at most n^. 

The drawing produced by algorithms front -drawing and move requires a 
real RAM to represent vertex coordinates. Nevertheless, it is possible to slightly 
modify them to obtain proximity drawings with 0(log n)-bit requirement with- 
out increasing the volume. 
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First, notice that, by construction, all vertices have integer z-coordinate. We 
can modify algorithm front-drawing so that also the y-coordinate is an integer, 
by rounding it to the nearest integer value. Finally, the ^-coordinate can be 
represented using the first 0(log n)-bits. It can be proved that the drawing so 
obtained is still a Gabriel drawing. Hence: 

Theorem 2. Any tree admits a three-dimensional weak proximity drawing of 
volume at most n'^ with 0{logn)-bit requirement. 

Finally, it is easy to see that if the input tree T is a binary tree then algorithm 
front_drawing, with 6 = n/2 -|- 1, produces a two-dimensional Gabriel drawing 
of T. Hence: 

Theorem 3. Any binary tree admits a two-dimensional weak proximity drawing 
with integer coordinates and at most area. 

5 Exponential Area versus Polynomial Volume. 

In this section we describe an infinite class of graphs such that any Gabriel draw- 
ing requires exponential area, while they can be drawn in the three-dimensional 
space in linear volume, instead. This gives evidence that the use of the third di- 
mension can substantially help in improving the efficiency and the effectiveness 
of the drawings. Additionally, we extend the results to /3-drawings for 1 < /3 < 2 
and to relative neighborhood drawings. 

The class has been introduced in m, and in m the authors proved an 
exponential-area lower bound. 



5.1 Class of Graphs. 

The class is inductively defined as follows. Graph Gi is the graph shown in 
Fig.E^a). The graph G^+i is obtained from Gi by adding five vertices 

and by connecting them to Gi as shown in Fig. m- Glearly, the num- 
ber of nodes of G„ is 5n-|- 1. We denote with Pi the pentagon of Gi given by the 
5-cycle v\vl 2 v\v\v\. Notice that each side of pentagon Pi form a triangle with a 
vertex of Pi+i, as well as each side of Pi+i with vertices in Pi. We refer such 
triangles as petals. 

The main result of uni is the following. 

Theorem 4 (|16I|I. A Gabriel drawing and a weak Gabriel drawing of graph 
Gn require area 17(3"), under any resolution rule assumption. 

In the same same paper, the authors generalized the previous result to /3- 
drawings, for any 1 < /3 < 
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5.2 Linear- Volume Drawings. 

In this section we describe a linear-time algorithm to construct a linear-volume 
strong Gabriel drawing of Gn- The correctness of the algorithm will be proved 
in the next section. 

We assume a eonstant distance S between any two layers. The value of 6 will 
be specified later. 

Consider the algorithm pentagons_in_3d of Fig. 0 All pentagons Pi are 
equally drawn on different consecutive layers as regular pentagons and then 
rotated by a 7 t/ 5 angle. Since the distance of consecutive layers is constant, and 
Pi is drawn in constant area, the volume is 0(n). Fig.EKc) shows a drawing of 
Gi. 

We will see in the following that, with a suitable choice of the value of 5, the 
drawing we obtain is a strong Gabriel drawing. 



algorithm pentagons_inJ3d(Gn) 

draw Gi on layer 1 such that Pi is a regular pentagon centered at «o 
for i = 2 to n do begin 

draw Pi on layer i as Pi_i rotated by tt/ 5 
connect Pi with Pi-i 

end 



Fig. 4. The algorithm to draw in linear volume. 





Fig. 5. The exponential-area/linear- volume class: (a) Graph Gi. (b) Graph Gi+i 
given Gj. 
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Fig. 6. (a) Two consecutive pentagons viewed from top. (b) How to draw a single 
petal, (c) The whole three dimensional drawing. 



5.3 Proof of Correctness. 

In this section we prove that the drawing obtained by algorithm pentagons J.n_3d 
is a strong Gabriel drawing of Gn- To this aim, we have to prove that: (a) The 
proximity region of any two adjacent vertices does not contain any other vertex; 
(b) The proximity region of any two non-adjacent vertices contains at least an- 
other vertex. First note that G\ is Gabriel drawable on layer 1. We distinguish 
the two cases: 

Adjacent vertices We have the following two cases: 

1 . The edge is a side of a pentagon. Simply observe that no vertex of the 
pentagon itself can fall within the proximity region. Let I be the length 
of the side of a pentagon. Set S at least equal to Z/2-1- 1. Then, no vertex 
of other pentagons falls within the region. Notice that the value of i5 is 
proportional to I, and does not depend on the number of nodes of G„. 

2. The edge connects two pentagons. In this case the edge belongs to a 
petal. Since all the petals are equally drawn, then the vertex closest to 
the region is the opposite vertex of the petal itself. It is easy to see that 
such a vertex is outside the region, if the petals are drawn as isosceles 
triangles, (see Fig.|^b)). 

Non-adjacent vertices We distinguish the following two cases: 

1. The two vertices are not on consecutive layers. There always exists a 
value of i5 > ^/2 -I- 1 so that, for each vertex in Pi and each vertex in 
Pi+ 2 , at least one vertex of Pi+i falls within the region of influence. 
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2. The two vertices are on consecutive layers. Without loss of generality, 
let us consider vertices v{ and and apply Lemma 0 with a = v\ 

and b — If we denote with b' the projection of 6 = on layer 

La, we have that S D[a,b'] (see Fig. 0a)). Thus, from Lemma 0 
we have that G R[a,b] = R[v\,vl^"^]. We can reasoning similarly in 
the other cases. 

From the above discussion we derive the following fact. 

Theorem 5. For any n, graph Gn admits a three-dimensional strong Gabriel 
drawing of volume 0{n). 

Slightly modifying algorithm pentagons_in_3d is possible to extend Theo- 
rem 0 to strong P proximity drawings for 1 < /3 < 2. In particular, it can be 
extended to produce relative neighborhood drawings of G„. More precisely, for 
1 < P < i_cos 27 r /5 algorithm is the same as in Fig.0 with a suitable choice 
of 6. To obtain a relative neighborhood drawing of G„ we need to draw Gi in 
a different way. Translate vertically vertex vq on level 0 leaving vertices of the 
pentagon Pi on level 1 as in algorithm pentagons_in_3d. It easy to verify that 
with a suitable choice of S the drawing so produced is a relative neighborhood 
drawing. Hence: 

Theorem 6. For any n, graph Gn admits a three-dimensional strong P-prox- 
imity drawing of volume 0{n), for any 1 < P < 2. 

6 Open Problems. 

Several problems concerning polynomial size proximity drawings are open. First 
of all it would be interesting to investigate possible extensions of our result along 
one or more of these directions: 

1. Extend P value. Consider /3-drawings of trees for 1 < /3 < 2. 

2. Extend the class. The first candidate might be outerplanar graphs. 

3. Strong proximity. In particular, do at least binary trees admit polynomial 
volume strong proximity drawings? 

4. Two-dimensional drawings. Can trees be drawn with polynomial area?. 

More generally, it would be interesting to consider the area requirement of other 
proximity drawings such as minimum spanning tree or relative neighborhood 
drawings. Finally, another important research direction is to characterize classes 
of graphs which admit proximity drawing. Notice that for trees this problem is 
solved for several definitions of proximity (including Gabriel) both in two and 
three dimensions 00C3IIII. 
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Abstract A graph is a tree of paths (cycles), if its vertex set can be par- 
titioned into clusters, such that each cluster induces a simple path (cycle), 
and the clusters form a tree. Our main result states that the problem 
whether or not a given graph is a tree of paths (cycles) is NP-complete. 
Moreover, if the length of the paths (cycles) is bounded by a constant, 
the problem is in P. 



1 Introduction 

Graph drawing defines one of its tasks as drawing graphs in a ’nice’ and ’un- 
derstandable’ way. But the meaning of the notions ’nice’ and ’understandable’ 
depends on the point of view; there is no universal definition of a ’good’ graph 
layout. 

One approach to produce expressive drawings are clustering techniques p| 
□ 115 . Clustering is a partitioning of the vertex set of a graph G into smaller 
sets called clusters, which satisfy certain criteria. Drawing graphs according to 
such a clustering appears in many application areas as VLSI design, software 
engineering, and knowledge representation. The main problem remains to find 
the corresponding clusters. Unfortunately in general this problem is NP-hard. 
However, if the clusters are known, ’nice’ and ’understandable’ graph drawings 
can often be constructed efficiently. 

In this paper we deal with certain graph classes called two-level clustered 
graphs introduced by Brandenburg 0. Given such a graph one is interested in 
drawings which make its structure transparent, especially if the graph is huge. 
Intuitively, given two graph classes X (level 1) and Y (level 2), a graph G is 
an A-graph of V-graphs {X of Y graph) if the vertices of G can be partitioned 
into clusters, such that (f) every cluster induces a graph of class V in G and 
(ii) the clusters form a graph of class X in G. The A-graph is obtained from 
G by shrinking first the clusters into single vertices and then the multiple edges 
into one edge. Examples are paths of paths, paths of cliques, trees of paths, trees 
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Figurel. Drawings of a tree of paths 



of cycles and so on. It is easy to see that every n x m-grid is a path of paths and 
every 2n x 2n-grid is a path of cycles. To the contrary every clique with more 
than four vertices is neither a path of paths nor a tree of paths. 

Two-level clustered graphs have been defined in a syntactic way by Kratochvil 
et. al. m They can be drawn according to their structure: On the top level draw 
the X-graph and on the second level the F-graphs (see j^l for more details). The 
obtained drawing reflects the nature of the graph and considerably simplifies 
its understanding. An example for a tree of paths is shown in Figure da) with 
thick lines indicating the paths. The drawings (b) and (c) are generated by the 
algorithms from and respectively. 

The main problem is to determine whether or not a graph is a two-level clus- 
tered graph. Complexity considerations concerning two-level clustered graphs 
are not known to be investigated in literature. To the contrary there is a great 
quantity of complexity results concerning for example partitioning of graphs, 
see GT11-GT16 in [^, or covering of graphs, see GT17 and GT18 in 0. This 
research is still in progress ^ 0. In fact almost all nontrivial partitioning and 
covering problems of graphs are NP-complete. Partitionings and coverings are 
two-level clusterings in which the first level graphs (A-graphs) are arbitrary. 
Hence, two-level clustering is a restrictive version of partitioning and covering. 
The restriction can lead to removing the NP-completeness. For example the 
problem whether or not a graph is a partition into cliques is NP-complete (see 
GT15 in jSj), whereas the problems whether or not a graph is a path of cliques 
or a large cycle of cliques are in P, as shown in |3]. Thus, it can not always be 
expected that every nontrivial two-level clustering problem is NP-complete. 

In this paper we consider trees of paths and trees of cycles. We show that 
given a graph G the question whether or not G is a tree of paths (cycles) is 
NP-complete. Moreover, if the length of the paths (cycles) is bounded, we give 
a polynomial algorithm based on the dynamic programming technique to solve 
these questions. Additionally, it also outputs such a tree clustering, if the answer 
is positive. A consequence of our results is that the problem whether or not a 
given graph is a tree of triangles is in P. This contrasts the fact that the problem 
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whether or not a given graph can be partitioned into triangles is NP-complete, 
see GTll in 0. 

2 Basic Notions 

In this section we review some basic notions on graphs and establish our notation. 
A graph with vertex set V and edge set E will be denoted by G = {V, E). The 
cardinality of the vertex set will be called the size of G. We deal with undirected 
graphs without loops and multiple edges. The subgraph of a graph G induced 
by a vertex set U is denoted by G\U]. A vertex set S' is a separator of G if graph 
G\V — S] is disconnected. 

A path of length n is a graph with vertex set V = {ui, . . . and edge 
set E = {('Ci, 'Ci+i)|l < i < n — 1}. Similarly, a cycle of length n is a graph 
with vertex set V = {v\, . . . ^Vn\ with n > 3, and edge set E = {(rii, Ui_|_i)| 
1 < i < n — 1} fi)}. For convenience, we also say that a subset P <ZV is 

a path (cycle) in G, if the subgraph G\P] is a path (cycle). Further a separator 
S is a path (cycle) separator if S is a path (cycle). 

Next we define two-level clustered graphs. In order to do this we first define 
for a graph G = (V, E) its clustering with respect to a partition of V. 

Definition 1. Let G = (V, E) be a graph and V\, . . . ,Vm a (nonempty) parti- 
tion of the vertex set V. The graph with vertex set {wi, . . . ,Wm} and edge set 
{(wi,Wj) I u G Vi,u' G Vj,i 7 ^ j, and {u,u') G E} is called the clustering of G 
with respect to the partition Vi, , Vm • 

Intuitively, the clustering of G is constructed from G by first shrinking every 
subgraph G\Vf\ into a single vertex Wi and then merging multiple edges. 

Definition 2. LetG = {V,E) be a graph and X andY two graph classes. Graph 
G is an X -graph of Y -graphs (X of Y graph), if there is a partition Vi, . . . , Vm 
of the vertex set V, such that 

1. the subgraphs G[Vi], . . . , G[Vm] are from the class Y, and 

2. the clustering of G with respect to the partition Vi, . . . , Vm is from the class 

X. 

In this paper we restrict ourselves to tree of paths and tree of cycles. Every 
such graph G has a tree clustering T. For reasons of readability we distinguish 
between nodes of T and vertices of G. Every node of T represents a path (cycle) 
in G. We will identify a node and its representing path (cycle) in G. A branch 
of T at w is the maximal subtree of T containing w and exactly one neighbour 
of w in T. Notice that the number of branches of T at w equals the degree of w 
in T. If the paths (cycles) of a tree clustering T have maximal length k then T 
will be denoted by T^. 

As examples notice the following facts: Obviously, every tree is a tree of 
paths, and every 2n x 2n-grid is a tree (path) of cycles. To the contrary every 
clique with more than 6 vertices is neither a tree of paths nor a tree of cycles. 
Furthermore, every cycle of length n is a tree (path) of paths, but it has no tree 
clustering of paths with k < |"n/2] . 
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3 Main Results 

In this section we deal with the problems whether or not a graph G is a tree of 
paths (ToP problem) and whether or not G is a tree of cycles (ToC problem). 
The problems are formally defined as follows: 

Name'. ToP problem. Name: ToC problem. 

Instance: A graph G. and Instance: A graph G. 

Question: Is G a tree of paths? Question: Is G a tree of cycles? 

If the length of the paths, respectively the length of the cycles is bounded by 
some integer k, then we call these problems k-ToP, respectively k-ToC. 

For our purpose we need a variant of the ONE-IN-THREE 3SAT problem 
(see 0, L04) in which the instances meet the following requirements: (i) no 
clause contains opposite literals, and (ii) if a variable x appears more than once, 
any of its literals x and x appears in at least two clauses. We call this restricted 
variant the r-ONE-IN-THREE 3SAT problem. 

Proposition 1. The r-ONE-IN-THREE 3SAT problem is NP-complete. 

Proof. The proof is a simple reduction from the ’’positive” ONE-IN-THREE 
3SAT problem, which is also NP-complete, see Pj. In this problem the instances 
are restricted to contain only clauses with positive literals. Let (f> be such an 
instance. For every variable x appearing more than once in (/) we add the clauses 
(x V Oaii V aa;J, {x V a ^3 V a^J, (xV V a^g), and (TV V to (j), where 
axi , ■ • ■ , CLxg are new variables. By construction the obtained expression (jf meets 
the desired requirements. Moreover it is easy to show that 4> is satisfiable if and 
only if (f) is satisfiable (in the sense of ONE-IN-THREE 3SAT). 

3.1 Tree of Paths 

In this subsection we first show that the ToP problem is NP-complete. Then we 
give a polynomial algorithm solving the fc-ToP problem. The algorithm can be 
modified easily to output a tree clustering T^ if it exists. 

The key to prove the NP-completeness is the gadget C shown in Figure 0 G 
has an important property which is summarized by the next lemma. 

Lemma 1. Let G be a tree of paths, T a tree clustering of G, and G the graph 
shown in Pigure]^ If C is an induced subgraph of G then the vertices of C belong 
to exactly two different paths Pi and P 2 of T , such that P\ contains two vertices 
of x,y,z and P 2 contains the vertices I, r and the remaining vertex of x,y,z. 

Proof. An easy inspection shows that the vertices x, y, and z may neither belong 
to the same path nor to three different paths of T. Thus x, y, and z belong to 
exactly two different paths P\ and P 2 of T. Without loss of generality we assume 
that Pi contains x and y and path P 2 contains z, see Figure 0a). 

Now consider vertex 1. Let P 3 be the path of T containing 1. Obviously, P 3 is 
different to Pi. If P 3 is also different to P 2 , then the paths Pi, P 2 , and P 3 would 
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not form a tree because they are pairwise connected by edges from G. Hence, 
P 3 and P 2 are identical in T and I belongs to P 2 - For symmetry the same holds 
for vertex r. 

Theorem 1. The ToP problem is NP-complete. 

Proof. Obviously the ToP problem is in NP. We show that it is NP-hard. The 
proof is a reduction from the r-ONE-IN-THREE 3SAT problem. Let (f be an 
instance with n clauses C\, . . . ,Cn- We construct a graph G{4>) and we show 
that (j) is satisfiable if and only if G{(f)) is a tree of paths. 

The key to our construction is the graph C of Figure El We create n copies 
Cl, ... , Cn of C side by side, one for every clause Ct. For convenience we will 
denote the copies of vertices x, y, z, I, r in Gi by xi, pi, Zi, li, rt, respectively. Ver- 
tices Xi^pi and Zi correspond to the literals of the clause Ci. We create a new 
vertex b and connect it with all vertices U and r^. Then we add an edge between 
vertices and U+i, 1 < i < n — 1. Finally, we add an edge between two ver- 
tices in different copies if and only if the vertices correspond to opposite literals. 
Figure 0 illustrates the construction. 

We claim that (p is satisfiable if and only if G{4>) is a tree of paths. In proof, 
suppose that there is a truth assignment satisfying (j>. Thus, every clause has 
two false and one true literal. In every copy Gi the vertices corresponding to 
the false literals are a path in G{<p), denoted by Pi. The vertices k and and 
the vertices corresponding to the true literals in the clauses are also a path in 
G{(p), denoted by P. Finally, the vertex 5 is a trivially single path denoted by 
P' . By construction P' is connected with P which itself is connected with every 
Pi. Since there is no connection between two different paths Pi and Pj (two false 
literals can never be opposite and thus never be adjacent). Pi, . . . , Pn, P, and P' 
form a tree clustering of G{4>). 

Conversely, suppose that G{4>) is a tree of paths whith a tree clustering T. 
Our aim is to define a truth assignment for (p. First, take a closer look at the 
structure of G{(p) to explore some important information about its clustering T. 
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<j> = {xy yV z) f\ {xy uV v) t\ {xV wV x) f\ {xW qW h) 




Figures. Reduction to ToP. The dotted edges connect vertices corresponding 
to opposite literals. 



By Lemma d the vertices Xi,yi, Zi,U,ri of Ci belong to two different paths 
of T, say PI and P21 such that PI contains two vertices from Xi,yi,Zi and P2 
contains the remaining vertices. We show that all paths are identical in T. 
Suppose that and P2 are different in T. Then the vertex b must belong to 
some of them, otherwise P2, P2 and the path containing b are pairwise connected 
and they can not form a tree. But this is a contradiction, because b is connected 
with at least two vertices of P2 and with at least two vertices of P2 ■ Hence, T has 
one path consisting of the vertices h, rt and of exactly one vertex from Xi,yi,Zi, 
\ <i < n. We denote this path by P. Consequently, the vertex b forms its own 
trivial path in G{(p), denoted by P'. Additionally, P and P' are different from 
every path P^ for 1 < j < n, of P. 

Next we show by contradiction, that there are no connections between two 
paths PI and P( in G{(j>). Suppose that there were a connection between some 
vertex pi S PI and some vertex pj € P ( . By construction pi and pj correspond to 
opposite literals, say x and x, respectively. We directly obtain: (i) the paths PI 
and P( are the same path in T ; otherwise P { , P( and P were pairwise different 
and pairwise connected in T which is a contradiction, and (ii) the variable x 
appears more than once. 

By (a) there are two further vertices Ps G Gg and pt G Gt, such that Ps corre- 
sponds to X and pt to x. Consider ps and pt, which are adjacent by construction. 
Their adjacency implies that they may not be both in P; otherwise P would not 
be a path. Thus, one of Ps and pt, say pt, belongs to some path, say Pf , different 
from P. Since pi and pt are connected in G{(f>) (they correspond to opposite 
literals) P{ and Pf are different in T; otherwise pi would have three neighbours 
on the common path, namely Pj,pt and one in P^. But then P{ (containing pi 
and Pj), Pi (containing pt) and P are pairwise different, and pairwise connected 
in T, which is a contradiction. Hence, there exist no connections between two 
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paths P{ and P( in G{4>) and the tree clustering T of G(^) contains exactly the 
paths P{, 1 < i < n, P and P' . 

Now we define a truth assignment ^ of ^ as follows. Let x be a literal of 
variable x. 

We set: 

JT ifP contains a vertex corresponding to x 
^ if some PI contains a vertex corresponding to x. 

It remains to show that A is well defined. Suppose there exists a literal x having 
both values T and F. There are three cases: 

In the first case, P contains two vertices corresponding to opposite literals. But 
then these vertices are adjacent in G((/>) and P would not be a path, which is a 
contradiction. 

In the second case. Pi,..., Pi contain vertices corresponding to opposite lit- 
erals. But this is impossible, because no P^ contains opposite literals, since no 
clause Ci contains opposite literals and because there are no connections be- 
tween Pi,. .. , Pi . 

In the third case, P contains a vertex p and some P{ contains p' both correspond- 
ing to the same literal, say x. We show that this is also impossible: Consider the 
vertex p" of G((/)) corresponding to literal x] such a vertex exists, because the 
variable x appears more than once. If p" is in P then p and p" are adjacent in 
P, which is a contradiction. If p" is not in P then it is in some path P(, which 
contradicts to the fact that there are no connections between P^ and P( and 
no clause contains opposite literals. Hence, A is well defined and the proof is 
complete. 

Next we show that the fc-ToP problem is solvable in polynomial time. Recall 
that the A:-ToP problem is the question whether or not a given graph has a tree 
clustering whose paths have a length bounded by some integer k. 

Remark 1. Although every graph having a tree clustering is a partial 2fc-tree, 
there is no closer relationship between the class of graphs having a tree clustering 
and the class of partial A:-trees. In fact, every cycle with length n is a partial 
2-tree (independent of n), but it has no tree clustering with k < |"n/2] . 

In the following, we give a polynomial algorithm solving this problem. It uses 
a dynamic programming technique and it is similar to the algorithm given in 
for the recognition of partial k-tiees. Its idea is based on the following lemmas: 

Lemma 2. Let k be an integer and G = (V,E) a graph with more than 2k 
vertices. Then G has a tree clustering if and only if there is a path separator 
P of G, such that every connected component of G\V — P\ augmented by P has 
a tree clustering rooted by P . 

Proof. Suppose that G has a tree clustering T^. Since G has more than 2 k 
vertices, has at least three paths. One of them, say P, is a separator of G. 
Consider the branches of at P. Obviously the branches are tree clusterings 
of the connected components of G\V — P] augmented by P. 
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For the only-if-case, let P be a path separator of G, and let Gi, . . . , G; be the 
connected components of G[V — P]. Suppose that every augmented component 
G[Ci LI P], 1 < i < I, has a tree clustering rooted by P. By identifying the 
vertices of P in Tf , . . . , we obtain a tree clustering for the whole graph 

G. 

The next lemma shows how to determine whether an augmented component 
G[Gi U P] has a tree clustering rooted at a path P. Here only tree clusterings of 
augmented components with a smaller size than G[Ci U P] are needed. 

Lemma 3. Let k be an integer, G = {V, E) a graph, and P a path separator 
of G with length at most k. Let C he a eonnected eomponent of G\V — P] and 
G[GUP] he the augmented eomponent of C . Then G[GUP] has a tree elustering 
rooted at P if and only if 

1. C is a path in G of length at most k, or 

2. there exist both a path separator P' o/G with length at most k and connected 

components C{, . . . , Gf, of G\V — P'] partitioning G — P' , such that 

(a) the augmented components G[C[ U P'], . . . , G[G^ U P'] have tree cluster- 
ings rooted at P' , and 

(b) P' contains all neighbours of P in G. 

Proof. Let P be a path separator of G with length at most k, C a, connected 
component of G[V — P], and G[GU P] the augmented component of G. Assume 
G[G U P] has a tree clustering rooted at P. If has two nodes then G 
necessarily is a path in G. If has more than two nodes, consider the son P' 
of P in (notice that P has only one son in T^, otherwise G would be discon- 
nected). P' is a path separator of G of length at most k. Obviously all connected 
components of G\V — P'] contained in G partition G — P'. Let C[, . . . , C'^ be 
these connected components of G\V — P'] contained in G. Furthermore the aug- 
mented components G[C[ UP'],..., G[G'j^ U P'j have tree clusterings rooted at 
P'. To see this, observe that every augmented component G[G' U P'j, 1 < z < to, 
corresponds to exactly one branch of at P' and that every branch represents 
a tree clustering rooted at P'. Finally, P' contains all neighbours of P in G, since 
P' is the only son of P in T^. 

Conversely, first suppose that a connected component G of G[y — P] is a path 
in G. The tree clustering consisting of root P and leaf G is a tree clustering of 
G[GU Pj. Next suppose there exist a path separator P' of G with length at most 
k, and connected components G(, . . . , G'^ of G\V — P'j partitioning G — P' and 
meeting the requirements of (a) and (b). Let Pf , . . . , be the tree clusterings of 
the augmented components G[G(UP'], . . . , G[G(^UP'j all rooted at P'. Consider 
the tree clustering obtained by identifying the nodes representing P' in every 
tree clustering P/', and by adding a new root P connected to P'. Since all 
neighbours of P in G are contained in P', P^ is a tree clustering of G[G U P] 
rooted at P. 

Now we present our algorithm. We can assume that the input graph has more 
than 2k vertices. First we construct the set S of all path separators of G with 




296 



Falk Schreiber and Konstantinos Skodinis 



length at most k. Then for every path separator P from S we find all connected 
components C of G\V — P] and we add tuple {G[C U P],P) to a set A. Next 
we sort the elements of A in increasing order according to the size of their first 
component. Finally, we examine all graphs G[G U P] of all tuples {G[C U P], P) 
from A, from smallest to largest and using Lemma 0 we determine whether or 
not graphs G[C U P] have a tree clustering rooted at P. To save this decision 
we define an array / realizing the boolean function A — > {true, false}. Finally, 
using Lemma 0 we determine whether or not graph G has a tree clustering T^. 

ALGORITHM 



Input: A graph G = (V, E). 
Output: answer = ''yes" or "no" . 



begin 

{ Initialization } 

5 := 0; A := 0; answer :=" no" ; 

{ compute set S } 

for every P CV with |P| < k 

do if (P is a path separator of G) 
then 5 := 5 U {P}; 



{ compute set A and initialize array I } 
for every P G S 

do for every connected component C oi G\V — P] 

do begin 



A 

1 



:=AU{(G[GUP],P)}; 
(G[GUP],P)1 := false] 



end; 

sort elements of A in increasing order according to the size of their first 
component; 



{ examine every element (G[G U P], P) of A in increasing order and 
determine whether graph G[G U P] has a tree clustering rooted at P using 
Lemma 0 } 

for every (G[G U P], P) of A in increasing order 

do begin 

if (G is a path of length at most k in G) 



then / (G[GUP],P) := true; 
else for every P' from S 

do if (3 C[,...,G'^ ofG[V-P' 

then if (/f(G[G' U P'], P') 



partitioning C — P') 
= true for 



every 1 < i < m) and 
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end; 



{P' contains all neighbours of P in C) 



then / 



(G[CUP],P) 



:= true] 



{ determine whether graph G has a tree clustering using Lemma |2|} 
for every P from S 



do if (/ {G[CUP],P) 



true for every G of G[y — P]) 



then begin 

answer := ''yes'" ] 

exit; 

end; 



end. 



Theorem 2. Let k be a fixed integer. The algorithm correctly determines whether 
or not an input graph with size n has a tree clustering in time. 

Proof. The correctness of the algorithm follows directly from Lemmas El and 
0 We examine its complexity. The construction of set S takes 0(n^+^) time. 
There are at most 0{n^) subsets of V with at most k elements and it takes at 
most 0{n^) time to check if such a subset is a path separator of G. Similarly, the 
construction of A and initialization of I take at most time. To determine 

whether an augmented component G[GUP] of G has a tree clustering rooted by 
P, takes at most time. There are at most Ofn^) separators P' and at 

most n connected components of G[V — P']. In addition, it takes at most O(n^) 
time to check whether some of them partition C—P'. The remaining if-conditions 
of the loop can be checked in 0{n) time. Thus, to examine all 0{n'"~^^) tuples 
of A can be done in time. Finally, to determine whether or not graph 

G has a tree clustering takes at most time. Hence, the total time 

complexity of the algorithm is 0(n^^+^). 

Our algorithm does not construct a tree clustering of the input graph, if there 
exists one. But this can be achieved by an obvious modification. 



3.2 Tree of Cycles 

In this subsection we investigate the ToC problem. First we show that it is NP- 
complete by a reduction from the r-ONE-IN-THREE 3SAT problem. The idea is 
very similar to the one used in Theorem 01 Here the key to our reduction is graph 
G shown in Figure El It is the graph G in Figure El of the previous subsection, 
extended by a new vertex t connected with vertices x, y, and z. Similar to Lemma 
0we obtain: 

Lemma 4. Let G be a^ tree of cycles, T a tree clustering of G, and C the graph 
shown in Figure ^ Lf G is an induced subgraph of G and the vertex t has degree 



298 



Falk Schreiber and Konstantinos Skodinis 






Figure4. Graph C and its different cycle clusterings 



3 in G, then the vertices t,x,y, z,l,r of G belong to two different cycles Qi and 
Q 2 ofT, such that Q\ contains vertices t and two of the vertices x,y,z, and Q 2 
contains the remaining vertices. 

Proof. Consider the vertices x,y, and z. They may neither belong to the same 
cycle nor to three different cycles of T: In the first case vertex t alone would 
form a cycle (notice t has degree 3 in G), which contradicts to the definition of 
cycles. In the second case T would be no tree clustering, since x, y, and z are 
pairwise connected. Thus x,y, and z belong to exactly two cycles Qi and Q 2 . 
Without loss of generality we assume that Qi contains x and y and Q 2 contains 
z, see Figure 0(a). 

Consider the vertex t in G. Since its degree is 3, it has to belong to the cycle 
Qi {t alone can not form a cycle in T). Now consider the vertex 1. Let Q 3 be 
the cycle of T containing 1. Trivially, Q 3 is different to Qi. If Q 3 is also different 
to Q 2 in T then Qi,Q 2 , and Q 3 are pairwise connected in G and T would be 
no tree. Hence, Q 3 and Q 2 are identical in G and I belongs to Q 2 . A similar 
argument shows that vertex r belongs also to cycle Q 2 . 



Theorem 3. The ToC problem is NP-complete. 

Proof. Obviously, the ToC problem is in NP. In order to show its NP-hardness 
we reduce the r-ONE-IN-THREE 3SAT problem to it. Since the reduction is 
very similar to the one in Theorem 0 we only sketch our proof. 

Let 4> be an instance of r-ONEHN-THREE 3^ AT. We construct a graph 
G{(j)) as follows: We create n copies Gi, . . . , G„ of G one for every clause Ci. We 
identify vertices Xi, yi and Zi of the i-th copy with the literals of clause Gi. Then 
we connect vertices and h+i, 1 < i < n — 1, as well as vertices l\ and r„. 
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(j) = {xy yy z) /\ {xy uW v) /\ (xM wW x) /\ (xy qW h) 




Figures. Reduction to ToC. The dotted edges connect vertices corresponding 
to opposite literals. 



Finally, if two vertices correspond to opposite literals then we connect these by 
an edge. Our construction is shown in Figure 0 

It remains to show that (j> is satisfiable if and only if G{(f>) has a tree clustering 
T. Suppose ^at there is a truth assignment satisfying (j). Let Qi be the cycle 
of the copy Ci, 1 < i < n, containing vertex ti and the vertices corresponding 
to false literals. Furthermore let Q be the cycle of G{(f>) containing all vertices 
h, fi, ^ < i < n, and the vertices corresponding to the true literals in the clauses. 
Since there is no connection between two different cycles Qi and Qj (two false 
literals can never be opposite and thus never be adjacent), Q and Qi form a tree 
clustering T of G{(j)). 

Conversely, suppose that G{4i) has a tree clustering T. By Lemma E| the 
vertices ti, Xi, yi, Zi, U and of Gi belong to two different cycles Q\ and Q\ 
of T, such that Q\ contains both vertex ti and two vertices of Xi,yi, Zi, and Q 2 
contains both the vertices h,ri and the remaining vertex of Xi,yi,Zi. We show 
that all cycles Q 2 identical in T. It is sufficient to show that for every i Q\ 
and same in T. Consider the vertex of Q\- It must have two 

neighbours in Q^. These neighbours have to be U+i and one of Xi,yi,Zi, because 
of Lemma 0 the remaining two neighbours of Xi have to belong to Q\ which is 
different to Hence, all cycles Q 21 1 ^ ^ represent the same cycle in T, 

denoted by Q. Consequently, T consists of the cycles Q and Q\, 1 <i <n. 

Now we define a truth assignment A of as follows. Let a; be a literal of 
variable x. 

We set: 



..... JT ifQ contains a vertex corresponding to x 
^ 1 F if some Q\ contains a vertex corresponding to x. 

Since there are no connections between two cycles Q\ and Q\ in G{(f) we can 
show analogously to the proof of Theorem 0 that assignment A is well defined. 
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Replacing paths by cycles in the algorithm for the fc-ToP problem (see sub- 
section EU leads to an algorithm solving the fc-ToC problem. 

4 Conclusion 

In this paper we have shown that given a graph G, the problem whether or 
not G is a tree of paths (cycles) is NP-complete. Moreover, if the length of the 
paths (cycles) is bounded by some integer k, we have developed an 
algorithm to solve this problem. Of course our algorithm is not practical. In 
further investigations we will try to improve this result by finding an algorithm 
having a better time complexity. Besides we are interested in the complexity of 
the problem whether or not a given graph is a path of paths, in particular if the 
number of the paths is two. 



Acknowledgment 

We would like to thank Prof. Dr. Franz- Josef Brandenburg for his improvements 
of the representation. 



References 

[1] S. Arnborg, D. Corneil, and A. Proskurowski. Complexity of finding embeddings 
in a k-tree. SIAM J. Alg. Disc. Meth., 8:277-384, 1987. 

[2] F. T. Boesch and J. F. Gimpel. Covering the points of a digraph with point- 
disjoint paths and its application to code optimization. J. Assoc. Comput. Mach., 
24(2):192-198, 1977. 

[3] F. J. Brandenburg. Graph clustering I: Gycles of cliques. In G. DiBattista, editor, 
Proc. of Graph Drawing, volume 1353 of Lect. Notes in Comput. Sci., pages 158- 
168. Springer- Verlag, New York/Berlin, 1997. 

[4] E. Cohen and M. Tarsi. NP-completeness of graph decomposition problems. J. 
Complexity, 7:200-212, 1991. 

[5] D. Dor and M. Tarsi. Graph decomposition is NP-complete: A complete proof of 
holyer’s conjecture. SIAM J. Comput., 26(4):1166-1187, 1997. 

[6] P. Eades and Q.-W. Feng. Multilevel visualization of clustered graphs. In S. North, 
editor, Proc. of Graph Drawing, volume 1190 of Lect. Notes in Comput. Sci., pages 
101-112. Springer- Verlag, New York/Berlin, 1996. 

[7] Q.-W. Feng, R. F. Cohen, and P. Eades. How to draw a planar clustered graph. 
In D.-Z. Du, editor. Computing and Combinatorics, volume 959 of Lect. Notes in 
Comput. Sci., pages 21-30. Springer- Verlag, New York/Berlin, 1995. 

[8] T. Fruchterman and E. Reingold. Graph drawing by force-directed placement. 
Software - Practice and Experience, 21(11):1129-1164, 1991. 

[9] M. R. Garey and D. S. Johnson. Computers and Intractability; A guide to the 
theory of NP-completeness. W. H. Freeman, 1979. 

[10] M. Himsolt. The graphlet system. In S. North, editor, Proc. of Graph Drawing, 
volume 1190 of Lect. Notes in Comput. Sci., pages 233-240. Springer- Verlag, New 
York/Berlin, 1996. 



NP-Completeness of Some Tree-Clustering Problems 301 



[11] I. Holyer. The NP-completeness of some edge-partition problems. SIAM J. Corn- 
put, 10(4):713-717, 1981. 

[12] J. Kratochvfl, M. Goljan, and P. Kucaera. String graphs. Technical report, 
Academia Prague, 1986. 

[13] C. Papadimitriou. Computational Complexity. Addison- Wesley Publishing Com- 
pany, 1994. 

[14] T. Roxborough and A. Sen. Graph clustering using multiway ratio cut. In G. Di- 
Battista, editor, Proc. of Craph Drawing, volume 1353 of Lect. Notes in Comput. 
Sci., pages 291-296. Springer- Verlag, New York/Berlin, 1997. 

[15] K. Sugiyama, S. Tagawa, and M. Toda. Methods for visual understanding of hier- 
archical systems. IEEE Transactions on Systems, Man and Cybernetics, 11:109- 
125, 1981. 




Refinement of Orthogonal Graph Drawings * 



Janet M. Six,, Konstantinos G. Kakoulis, and loannis G. Tollis 

CAD & Visualization Lab 
Department of Computer Science 
The University of Texas at Dallas 
P.O. Box 830688, EC 31 
Richardson, TX 75083-0688 
{Janet .kostant , tollis}@utdallas . edu 



Abstract. Current orthogonal graph drawing algorithms produce draw- 
ings which are generally good. However, many times the readability of 
orthogonal drawings can be signihcantly improved with a postprocess- 
ing technique, called refinement, which improves aesthetic qualities of 
a drawing such as area, bends, crossings, and total edge length. Rehne- 
ment is separate from layout and works by analyzing and then fine-tuning 
the existing drawing in an efficient manner. In this paper we define the 
problem and goals of orthogonal drawing refinement and introduce a 
methodology which efficiently rehnes any orthogonal graph drawing. We 
have implemented our technique in C-I--I- and conducted preliminary ex- 
periments over a set of drawings from five well known orthogonal drawing 
systems. Experimental analysis shows our technique to produce an aver- 
age 34% improvement in area, 22% in bends, 19% in crossings, and 34% 
in total edge length. 



1 Introduction 



Orthogonal graph drawings represent nodes with boxes and edges with polygonal 
chains of horizontal and vertical line segments which reside on an underlying grid. 
Drawings in this style are useful for applications which benefit from high clarity 
representations. Much research has been conducted in this area and various 
algorithms exist to produce orthogonal drawings of planar , general 

maximum degree four mm. and general higher degree graphs I2C3I2H. An 
extensive experimental study was conducted by Di Battista et. al. PJj where four 
general purpose orthogonal drawing algorithms were implemented and compared 
with respect to area, bends, crossings, edge length, and running time. 

Many papers have suggested ways of evaluating the “goodness” of a graph 
drawing (for example |8fl Iff 7f20f‘2S] ) in addition to the standard measures of 
area, crossings, bends, and edge length which are used as a means to evaluate the 
quality of a graph drawing algorithm. The achievement of many of these goals. 
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aesthetics and constraints is known to be NP-Hard. Complicating this issue is the 
experience that maximizing one particular quality of a drawing causes another 
to be significantly poor since some of these qualities work against each other. 
Therefore most algorithms try to layout the graph in a manner which is good 
for some set of aesthetics. 

Current orthogonal graph drawing algorithms produce drawings which are 
generally good. However, many times the readability of orthogonal drawings 
can be significantly improved with a postprocessing step which modifies the 
positions of nodes, edges, and bends. It is vital that drawings which are created 
by any graph layout system are very readable. Refinement is a postprocessing 
methodology which can be applied to any orthogonal drawing and improves 
readability by analyzing the drawing and then fine-tuning it while keeping the 
majority of the layout intact. The result is a new drawing which has improved 
aesthetic qualities including area, bends, crossings, and edge length. Previous 
work includes compaction strategies tiimm and movement of stranded nodes 
m- However, the scope of these postprocessing techniques is limited. A more 
sophisticated methodology is needed to further improve the aesthetic qualities 
of graph drawings. We have focused on the development and implementation 
of several efficient refinement modules which work on any orthogonal drawing 
(including degree greater than four). An example of a drawing before and after 
refinement is shown in Figure E 




Fig. 1. One of the Rome graphs as drawn with GIOTTO on the left. The same 
drawing after refinement on the right (same scale). There is a 29% improve- 
ment in area, 13% improvement in the number of bends, 11% in the number of 
crossings, and 24% in the total edge length. 



There are two types of refinement: interaetive and static. During the inter- 
active refinement of drawings we must maintain the user’s mental map ra and 
are allowed to make only minimal changes. The requirements for the refinement 
methodology are very similar to those of interactive graph drawing algorithms 
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The fact that the user has already seen a drawing of a graph 
means that the refinement technique must not make changes so drastic that 
pieces of the drawing are not recognizable. Static refinement fine tunes drawings 
for which we do not have to maintain the user’s mental map: we are free to make 
any change in the embedding. Certainly refinement cannot be a cure for a very 
poor layout because this would require the essential invocation of some other 
layout algorithm. Refinement fine-tunes an existing drawing by improving some 
layout qualities. Our tool performs static refinement on any orthogonal graph 
drawing. 

Our refinement technique produced a significant 19% to 34% average im- 
provement for each of the generally accepted characteristics area, bends, cross- 
ings, and total edge length in preliminary experiments over drawings from five 
algorithms. Since different applications require different classes of drawings and 
therefore need to focus on varying kinds of refinement, our system has the flex- 
ibility to vary the types and order of refinement modules called, so that a user 
may refine drawings in a manner specific for a particular application. 



2 Refinement 

During a survey of orthogonal drawings from a variety of sources, we repeatedly 
observed extra area, bends, edge crossings, and edge length caused by U-turns 
in edges (as described in IE]), superfluous bends, poor placement of degree two 
nodes, two incident edges of a node crossing, nodes stranded very far from their 
neighbors, and unused space inside the drawing. See Figure0for examples. Note 
that the attachment points of the left three vertical edges in the self crossing 
example, Figure EP, are not moved. The node is extended to the left allowing 
the placement of the edge with the bend. Then the right side of the node is 
contracted since the space of the old edge placement is no longer needed. 

Specifically, U-Turns are three contiguous edge sections which form a “U” 
shape with the middle section pulled far from the source and target nodes of 
those three sections. Superfluous bends are those which exist even if there is 
room in the space of the drawing for an edge with less bends. Clearly U-Turns 
and superfluous bends can occur multiple times in edges which have four or 
more sections. Poorly placed degree two nodes are those which are neither on 
a bend nor in the midst of its two incident edges. Self crossings are those which 
occur between two edges incident to the same node. Self crossings are divided 
into two categories: near and far self crossings. Near self crossings are those 
whose positions differ from that of the node in either the horizontal or vertical 
orientation. Far self crossings differ in both orientations. A stranded node is a 
degree one node which is placed very far from its neighbor. 

Fixing a set of the above defined problems with a sequence of refinements 
will certainly reduce the visual complexity of the drawings, however we take our 
methodology one step further. If we perform these refinements directly on the 
given drawing we will improve the quality of the drawing, but will still miss some 
of those improvements which are less visually obvious. Therefore we preprocess 




Refinement of Orthogonal Graph Drawings 305 




Fig. 2. Examples of the problems we solve with a refinement technique: A: U- 
turns, B: superfluous bends, C: poor placement of degree two nodes, D: self- 
crossings, E: stranded nodes, and F: extra area. 



the given graph into a simpler one. First we absorb all chains of degree two 
nodes into an edge and then denote each degree one node to be a super node and 
determine the minimum distance needed between it and its neighbor (as is also 
done in jSEl)- All refinement operations are performed on this simplified graph. 
The refinement techniques have been implemented to acknowledge the presence 
of super nodes and edges and place them in some appropriate manner. After 
refinement is complete the preprocessing is undone in order to restore the graph 
to its original topology. The preprocessing operations allow our methodology to 
catch significantly more of the above problems and therefore produce better quality 
drawings. 

Procedure: Refine-Orthogonal-Graph-Drawing 
Input: An orthogonal graph drawing, T, of a graph, G 

Output: A new orthogonal drawing, T' , of G with a lower visual complexity 

1. Build the abstracted graph, G', of G, such that 

(a) Each chain of degree two nodes is abstracted into a single edge. 

(b) Each degree one node is denoted to be a super node and the minimum 
necessary distance between it and its lone neighbor is calculated. The 
minimum distance is directly proportional to the number of absorbed 
degree two nodes in the lone incident edge. 

2. For each edge, e, in G' 

(a) If e contains a sequence of three edge segments which form a U-Turn 
edge, then pull in the middle segment of that sequence so that it is as 
close as possible to its source and target. 

(b) If e contains a sequence of three edge segments which have an extra 
bend and there exists room for a lower bend edge routing in the drawing 
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then replace the current routing with the lower bend solution. Use the 
bend-stretching transformations of m- 

(c) If e is an edge which represents a chain of degree two nodes in G, post- 
process this edge to restore the degree two nodes and verify that each 
degree two node is either on a bend or in the midst of its two incident 
edges. 

3. For each node, v, in G' 

(a) If V has a near self crossing, expand the node by one row or column in 
the appropriate direction and move the attachment point of the trouble 
edge to that new row or column. Place any abstracted degree two nodes 
so that they do not occlude any other drawing element. If v has a far 
self crossing and is degree two, then place the node at the location of the 
crossing. Otherwise add one row or column and break the crossing into 
a knock-knee CHI edge routing. Both of these far self crossing solutions 
swap the attachment points of the crossing edges at v so that neither of 
the neighbor nodes is moved. 

(b) If u is a stranded node, place it as close to the neighbor node as space 
allows in F. 

4. Perform a VLSI layout compaction miBi to remove extra space in the 

drawing. 

5. Undo the preprocessing of step 1. □ 

Many improvements may be made without increasing the area of the drawing, 
but allowing the addition of more area may enable refinement to significantly 
improve other aesthetics of the drawing. For example, adding a row or column 
may be necessary to remove a self crossing. However this allowance should be 
according to user requirements and must be parametrically defined. 

It is important to emphasize that refinement is an evolving process. We are 
planning to implement additional modules for improving orthogonal drawings 
as we discover and develop further techniques. 

3 Implementation and Experimental Results 

3.1 Implementation 

Refine-Orthogonal-Graph-Drawing has been implemented in GNU G-l— I- Version 
2.7.2 with Tom Sawyer Software’s Graph Layout Toolkit Version 2.3.1 and a 
TGL/TK graphical user interface. A set of experiments has been run on a Sparc 
5 running Sun OS Release 4.1.3. 

Many interesting and challenging issues were addressed during the imple- 
mentation of Refine-Orthogonal-Graph-Drawing. First we needed a mechanism 
to search the space within the given drawing to move pieces of the drawing with- 
out occluding uninvolved elements. We represent the space of the drawing with 
a dynamic orthogonal grid structure in which rows and columns may be added 
at any point within the space. Elements of the drawing are represented with grid 
segments owned by nodes, edges, and bends. 
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Each of the refinement modules can be viewed as a local searching technique. 
The module which shortens U-Turn edges looks at the endpoints of the middle 
edge and places them as close as possible to their neighbors. If the new placement 
of the middle segment is still orthogonal and does not occlude any drawing 
elements we are done. Otherwise we search the space toward the old placement 
until sufficient space is found for the middle segment. At worst, this will be the 
old placement. See Figure 0 It is important to note that the edges involved in 
the U-Turn may actually represent a chain of degree two nodes, therefore we 
must detect that situation and be sure to place those nodes only where there 
is sufficient space in the grid. We iteratively place each degree two node closer 
to its neighbor. Also, we examine each set of three contiguous segments in each 
edge so that we can catch more of the U-Turns. This is especially important 
when we are really dealing with degree two chains. 



D- 





Fig. 3. Fixing a U-Turn edge. The left illustration shows the involved elements 
in their original positions. The endpoints of the middle segment are shown with 
circles and the source and target of the U-Turn with boxes. The right illustration 
shows the final placement of the middle edge. 



The superfluous bends module examines each set of three contiguous edge 
segments in every edge. For each set, call one endpoint of the middle edge seg- 
ment X and the other y. Define a and b to be the points shown in Figure 0 If 
space in the drawing allows, then place a; at a or y at b. 




X 

y 



Fig. 4. Superfluous bend solutions. The old edge routing is shown with a dashed 
line in the two illustrations on the right. 



The self crossing module inserts rows or columns as specified in Step 3 of Pro- 
cedure Refine-Orthogonal-Graph-Drawing. Since the grid is dynamic, we insert 
the new gridlines inside the node to fix a near self crossing on the appropriate 
side and that automatically forces the node to grow. For far self crossings of a 
degree three or higher node, a row or column is inserted at that crossing (see 
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Figure 0 for examples). The first far self crossing solution is for the case where 
the node has degree two while the second solution is for higher degree nodes. The 
second solution adds two bends and therefore the superfluous bends refinement 
module should be run on the new drawing to remove these extra bends if possi- 
ble. Some users may not wish to save a crossing at the potential cost of a new 
row and two additional bends, therefore the user is given a parametric option 
whether or not to use this particular solution. Also note that in a far crossing the 
attachment points of the two crossing edges are swapped so that the positions 
of those neighbors are not changed. If the movement of the attachment points 
is not acceptable, the user may set a parametric option preventing this action. 







Fig. 5. Near and far self crossing solutions. Old node placements and edge rout- 
ings are shown with dashed lines. 



The stranded node module removes involved node and edge segments from 
the grid and searches the space from the neighbor node out. If the lone incident 
edge actually represents a chain of degree two nodes, this process is iteratively 
carried out for each degree two node from the original node out to the stranded 
node, see Figure 0 

The user also has a parametric option for this module to allow the addition 
of rows and columns as necessary to avoid adding any crossings or to allow 
crossings to remain. This option gives the user the ability to decide to give 
priority to area or crossing reduction. We believe that reducing the number of 
crossings is paramount and chose the first parametric option for our experiments. 
In part, this decision was influenced by the study presented in which showed 
the number of crossings to have a very significant effect on readability. This is 
certainly not to say that avoiding crossings will always cause the addition of 
area. In fact, in many drawings, the area will still be reduced while avoiding 
crossings. 

The module which Axes poor placement of degree two nodes, first postpro- 
cesses edges which represent degree two chains and then visits each newly re- 
stored degree two node to verify that it lies either on a bend or in the midst of 
its two incident edges. 

The implementation of the compaction module is inspired by one dimensional 
VLSI layout compaction PITO- A one dimensional graph-based compaction 
is performed once each for the horizontal and vertical directions. Similar types 
of compaction modules have been used in mm- 
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Fig. 6. Fixing a stranded node with an edge that represents a degree two chain. 
Notice that not only are we placing related nodes closer to each other, but also 
unwrapping a poorly routed edge. 



It is recognized that different users may want different types of drawings and 
need to refine a specific aesthetic quality. Also different algorithms merit the use 
of different types of refinement: drawing algorithms inherently have strengths 
and weaknesses with respect to different aesthetic criteria. Static orthogonal 
graph drawing algorithms either planarize and then embed with a planar al- 
gorithm or proceed in an essentially incremental fashion. While the problems 
described in the previous section occur with all of the orthogonal algorithms 
surveyed, some types of problems occur more frequently in a particular class of 
orthogonal algorithms. For example, planarization algorithms have a tendency to 
have some very long edges and place nodes far away from their neighbors. Incre- 
mental algorithms tend to have superfluous bends. Our refinement methodology 
automatically detects these problems and fixes them regardless of the class of 
algorithm used to create the drawing. Furthermore, our implementation com- 
municates with the user via a graphical user interface which allows the user to 
perform the desired set of refinements in any order. This flexibility adds power 
to refinement in that the user can refine any orthogonal drawing in a manner 
which is application specific. 

We have designed and implemented all of our refinement techniques to take 
linear time with respect to the number of grid segments. The number of grid 
segments is bounded by the number of nodes and edges and hence refinement 
takes 0{n + m) time, where n is the number of nodes and m is the number of 
edges. 



3.2 Experimental Results 

We have conducted a set of preliminary experiments with our implementation. 
The source drawings are layouts of Rome graphs (available at http : / / www . 
inf . uniromaS . it/people/gdb/wpl2/L0G.html) produced by the Giotto, Bend- 
Stretch, Column, and Pair algorithms as implemented at Brown University’s 
Graph Drawing Server 0 (http : / /loki . cs .brown. edu: 8081/graphserver), 
and Tom Sawyer Software’s Graph Layout Toolkit version 2.3.1. Each drawing 
was given as input to our refinement implementation and data collected as to 
the improvements made in area, bends, edge crossings, and total edge length for 
each drawing. A table summarizing the average percent improvement for this 
set of aesthetics over drawings from the five layout algorithms is given next. 
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In the table, the left column of percentages for each algorithm represents the 
average improvement over all drawings. This includes the drawings for which our 
technique is unable to improve the drawing with respect to that aesthetic. The 
second column of percentages for each layout technique represents the average 
percent improvement for those drawings which our technique was able to strictly 
improve. The row marked 5, represents the average percent improvement made 
with respect to all four aesthetics for that particular algorithm. A represents the 
average percent improvement made for area, bends, crossings or edge length over 
all of the experimental source drawings. Note that refinement acts on the input 
drawing which is produced by a specific implementation of a chosen algorithm. 
As such, refinement improves aesthetic qualities caused by possible problems of 
both a chosen algorithm and the implementation of that algorithm. 

Our implementation of refinement makes a 34% improvement on average 
in both area and total edge length. This huge improvement is due largely to 
the modules of our technique which shorten long edges. As it is well known in 
the VLSI circuit design field, the area is usually dominated by the amount of 
wiring. Likewise, the area of graph drawings is usually dominated by the edge 
lengths. So our methods which shorten the total edge length also inherently 
decrease the area. Hence we see a proportional improvement in both area and 
total edge length. In addition the area is also reduced by the compaction phase. 
Although several orthogonal layout algorithms already use a compaction phase, 
our compaction has such a significant effect since other phases of refinement have 
simplified the drawing by reducing its geometric complexity. Therefore, pieces of 
the drawing have more freedom to move and can therefore be compacted more 
efficiently. 

Our experiments also show about 20% improvement on average with respect 
to bends and crossings. This is due to the modules which particularly refine 
those elements. 

Refinement significantly improves drawings created by each of these orthogo- 
nal algorithms. As mentioned earlier, every layout algorithm has a set of strengths 
and weaknesses. These strengths and weaknesses with respect to area, bends, 
crossings and total edge length are apparent in the numbers collected for each 
algorithm in our experiments. Giotto is the most evolved implementation of the 
Graph Drawing Server (GDS) and thus refinement has a lesser impact on these 
drawings. One of the main steps of Giotto finds the minimum number of bends 
of the planarized graph I2g. Our refinement improved the number of bends by 
an average 2%. When our tool was able to improve the number of bends, the 
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improvement was on the average 8%, with some improvements up to 50%. The 
planarization step of Giotto causes some nodes to be placed far from their neigh- 
bors, hence we see a more significant improvement of 21% with respect to total 
edge length. 

Likewise we notice similar behavior with Tom Sawyer Software’s Graph Lay- 
out Toolkit (GLT). Their implementation allows each edge to have at most one 
bend. So the average is 11% compared to the average 22% improvement in the 
number of bends over the entire experiment set. Golumn and Pair drawings ex- 
perience very significant improvements of each aesthetic. Especially notice the 
average 40% and 37% improvements in the number of bends and the average 
51% and 45% improvements in total edge length. This is related not only to the 
nature of these algorithms, but also to the implementation of these algorithms 
in the GDS. As discussed above, the quality of a drawing before refinement 
depends heavily on a chosen algorithm, and even more heavily on the implemen- 
tation of the algorithm. Refinement fixes problems caused by both the algorithms 
and their implementations. This is evidenced by the behavior of refinement on 
Golumn and Pair drawings. Instead of dividing input graphs into biconnected 
components and performing a layout on each component, the GDS implementa- 
tions of Golumn and Pair augment graphs to make them biconnected and then 
perform the layout on the augmented graph. The augmenting edges are removed 
during the final step and the resulting drawing shows only the input graph. This 
implementation decision has increased the geometric complexity of many Gol- 
umn and Pair drawings. Our refinement technique provides a 9% to 30% average 
improvement of Bend-Stretch drawings for all aesthetics considered. 

It is important to note the difference between the “All” percentages and the 
“Better” percentages. All five of these orthogonal algorithms produce good draw- 
ings: sometimes the number of crossings and bends is already optimal. Of course, 
the refinement tool cannot reduce the number of bends in a drawing which al- 
ready has the lowest possible number of bends. Also, some layouts do not allow 
the embedding to be refined much. Hence, the “All” percentage improvement is 
lower than the strictly “Better” improvement. 

Example drawings from Bend-Stretch, Golumn, Pair and Tom Sawyer’s GLT 
along with their refined drawings are included in the following pages. Such an 
example for Giotto appears in Figure [D 

4 Conclusions and Future Work 

In this paper we presented an efficient postprocessing technique to improve aes- 
thetic qualities of any orthogonal graph drawing. Specifically we focused on re- 
ducing the area, number of bends, number of crossings and total edge length. A 
preliminary experimental study conducted over a set of drawings from five well 
known algorithms produced very good results. An average 34% improvement 
was made in area, 22% in bends, 19% in crossings, and 34% in edge length. 

Refinement is an evolving technique. We plan to implement more modules 
of refinement as we develop further techniques to improve orthogonal graph 
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Fig. 7. One of the Rome graphs as drawn with Column on the left. The same 
drawing after refinement on the right (same scale). There is a 47% improve- 
ment in area, 42% improvement in the number of bends, 40% in the number of 
crossings, and 56% in the total edge length. 



drawings. Also, we plan to further enhance the graphical user interface of our 
refinement tool so that the refinement process can be even more tailored to 
an individual user’s needs. More parametric options, such as the one for the 
stranded node module which sets a priority for crossings or area, will be added. 
Further, we would like to capture the user’s modal interactions with the drawing 
elements so that we can further improve the quality of given drawings to some 
application-specific standard. 



5 Acknowledgments 

The authors would like to thank Ugur Dogrusoz and Therese Biedl for their 
technical advice and Tom Sawyer Software, especially its President and CEO 
Brendan Madden, for making their software available to implement our refine- 
ment modules. We are also grateful to Stina Bridgeman, Roberto Tamassia and 
the Graph Drawing group at Brown University for the Graph Drawing Server 
which is a great resource. 



Refinement of Orthogonal Graph Drawings 313 





Fig. 8. One of the Rome graphs as drawn with Bend-Stretch on the left. The 
same drawing after refinement on the right (same scale). There is a 34% im- 
provement in area, 24% in the number of bends, 33% in the number of crossings, 
and 39% in the total edge length. 





Fig. 9. One of the Rome graphs as drawn with Tom Sawyer Software’s GLT 
2.3.1 on the left. The same drawing after refinement on the right (same scale). 
There is a 34% improvement in area, 40% improvement in the number of bends, 
100% in the number of crossings, and 36% in the total edge length. 
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Abstract. The general map labeling problem consists in labeling a set 
of sites (points, lines, regions) given a set of candidates (rectangles, cir- 
cles, ellipses, irregularly shaped labels) for each site. A map can be a 
classical cartographical map, a diagram, a graph or any other figure that 
needs to be labeled. A labeling is either a complete set of non-conflicting 
candidates, one per site, or a subset of maximum cardinality. Finding 
such a labeling is NP-hard. 

We present a combinatorial framework to attack the problem in its full 
generality. The key idea is to separate the geometric from the combina- 
torial part of the problem. The latter is captured by the conflict graph 
of the candidates and by rules which successively simplify this graph 
towards a near-optimal solution. 

We exemplify this framework at the problem of labeling point sets with 
axis-parallel rectangles as candidates, four per point. We do this such 
that it becomes clear how our concept can be applied to other cases. We 
study competing algorithms and do a thorough empirical comparison. 

The new algorithm we suggest is fast, simple and effective. 

1 Introduction 

Map labeling is a classical problem of cartography. Since the first attempts of 
automating map production, an abundance of approaches has been applied to 
this problem: expert systems, 0-1 integer programming, and simulated annealing 
to name only a few. Map labeling is usually divided into point, line and area 
labeling. In recent years, especially the problem of point labeling has achieved 
some attention in the algorithms’ community. Two interesting sub-problems have 
been studied. In both cases, an instance consists of a set of sites and a set of 
label candidates for each site. 

1. The Label Size Maximisation Problem: Find the maximum factor a such 
that each site gets a label stretched by this factor and no two labels overlap. 
Compute the corresponding complete label placement. 

* This work was supported by the Deutsche Forschungsgemeinschaft (DFG) under 
grants Wa 1066/1-1, 3-1, and 3-2 
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2. The Label Number Maximisation Problem: Find a maximum subset of the 
sites, and for each of these sites a label from its set of candidates, such that 
no two labels overlap. 

The decision versions of both problems are NP-hard |fW^FFt 8T| ; size 
maximisation only if the sites have more than two label candidates. In the fol- 
lowing we assume that a label candidate touches its site. This makes it easier to 
match label and site. 

There is an approximation algorithm which maximises the size of uniform 
axis-parallel square labels. It is optimal in respect to both, its approximation 
factor of 1/2 and its running time of 0(n log n) |FW91 1 [Wag94| . For the same 
problem, there is an algorithm which keeps the theoretical optimality of the 
approximation algorithm, but performs close to optimal in practice |W W97j . 

For square labels of arbitrary orientation and for circular labels there are 
approximation algorithms maximising label size, again under the restriction that 
all labels are uniform, i.e. of equal size jDMM+9^ . In the same paper, Doddi 
et al. suggest a bicriteria algorithm which mediates between the two problems 
mentioned above. Given an £ > 0, the algorithm labels at least a (1 — e) - fraction 
of the points with axis-parallel uniform square labels of size at least OPT /(1-I-e), 
where OPT is the edge length of the squares in an optimal solution of all points. 
The algorithm puts 1/e equidistant markers on each label edge and places the 
label such that one of the markers coincides with the point to be labeled. 

The complexity of the label number maximisation problem is quite different. 
Even for axis-parallel rectangular labels of arbitrary height and width, there is 
an approximation algorithm, however with a ratio of just l/0(logn) |AvKS97) . 
If the label height (or width) is fixed though, the problem can be approximated 
by a factor of 1/2 in 0(n log n) time. For maximising the size of uniform rect- 
angular labels, this approximation factor is optimal, but for maximising the 
number of fixed-height labels, Agarwal et al. also presented a polynomial time 
approximation scheme (PTAS) in the same paper. 

If fixed-height rectangular labels are allowed to touch their sites anywhere 
on the rectangle boundary, there is still a PTAS and an 0(n log n) algorithm 
that guarantees to label at least half the number of sites labeled in an optimal 
solution |vKSW9Sj. 

Very recently, Kakoulis and Tollis suggested a more general approach to la- 
beling (EI2B|. They compute the candidate conflict graph and its connected 
components. Then they use a heuristic similar to the greedy algorithm for max- 
imum independent set to split these components into cliques. Finally they con- 
struct a bipartite “matching graph” whose nodes are the cliques of the previous 
step and the sites of the instance. In this graph, a site and a clique are joined 
by an edge if the clique contains a candidate of the site. A maximum cardinality 
matching yields the labeling. Due to the last step, their algorithm takes 0{ky/n) 
time in practice. 

The algorithm for label number maximisation we present in this paper has 
the following advantages compared to previously suggested algorithms. Our al- 
gorithmic approach 
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• does not depend on the shape of labels, 

• can be applied to point, line, or area labeling (even simultaneously) if a finite 

set of label candidates has been precomputed for each site, 

• is easy to implement, 

• runs fast, and 

• returns good results in practice. 

The input to our algorithm is the conflict graph of the label candidates. 
The algorithm is divided into two phases similar to the first two phases of the 
algorithm for label size maximisation described in IWWD7I . In phase I, we apply 
a set of rules to all sites in order to label as many of them as possible and to 
reduce the number of label candidates of the others. These rules do not destroy 
a possible optimal placement. Then, in phase II, we heuristically reduce the 
number of label candidates of each site to at most one. 

For the rules we apply in phase I, there is a more general concept discussed 
in the artificial intelligence community under the name constraint satisfaction 
which was independently introduced into the discrete mathematics community 
by Knuth and Raghunathan under the name problem of compatible representa- 
tives EBS21. The difference of our approach to that of the artificial intelligence 
community is that we try to maximise the number of variables (sites) with a 
conflict-free assignment, while their objective is to either list all assignment tu- 
ples without conflicts |MF85j , to minimise the number of conflicts , or to 

find the maximum weighted subset of constraints which still allows an assign- 
ment. 

This paper is structured as follows. In Section 0we describe our ideas within 
the framework of constraint satisfaction. In Section 0 we specialise this general 
concept to the context of point labeling and give the details of our two-phase 
algorithm. The rules of phase I are derived from the general case. In Section 2]we 
describe the set-up and the results of our experiments. We compare our algorithm 
to two other methods, namely simulated annealing and a greedy method. 

Part of the examples, on which we do the comparison, are benchmarks that 
were already used in [WWTFTj to evaluate the algorithm that maximises the size 
of uniform square labels. We added examples for placing rectangular labels of 
varying size, both randomly generated and from real world data. Our samples 
come from a variety of sources; they include the location of some 19,400 ground- 
water drill holes in Munich, 373 German railway stations, and 357 shops. The 
latter are marked on a tourist map of Berlin, which is labeled on-line by our 
algorithm. The algorithm is also used by the city authorities of Munich to label 
their drill-hole maps. All example generators, real world data and algorithms are 
available on the World Wide WelEl 

Our tests differ from experiments performed by other researchers |Hir82l 
IGMS95|IGFMS971 IvKSW 9MIKT98| in that we included example classes where we 
could measure our results with respect to tight bounds on the optimal solution. 



Refer to http://www.inf.fu-berlin.de/map-labeling/ 
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2 Framework 

A constraint satisfaction problem (CSP) is defined as follows. Given a set of 
n variables vi, . . . ,Vn, each associated with a domain and a set of relations 
constraining the assignment of subsets of the variables, find all possible n-tuples 
of variable assignments that satisfy the relations mm . Often variable domains 
are restricted to discrete finite sets, and only binary relations are considered. 

Graph colouring is a special case of a GSP where the variables are nodes, the 
domains a given set of colours, and binary relations express the fact that a node 
cannot have the same colour as any of its neighbours. Since graph colouring is 
NP-complete, one cannot expect to solve general GSPs in polynomial time. For 
this reason, the class of network consistency algorithms has been invented. These 
algorithms use local arguments to exclude values from the domain of a variable 
that cannot be part of a global solution. Network consistency algorithms can be 
seen as a preprocessing step to backtracking since they often reduce the search 
space very effectively. 

An m-consistency algorithm removes all inconsistencies among m of the given 
n variables. In the special cases of m = 1, 2, and 3, these algorithms are called 
node, arc, and path consistency algorithms, respectively. Mackworth and Freuder 
have shown that arc consistency can be achieved in 0{a?k) where a is the size 
of the variable domains and k the number of binary relations IMEHni. 

This framework can be used nearly one-to-one for attacking the label size 
maximisation problem. When maximising simultaneously the sizes of all labels, 
one can do a binary search on conflict sizes, i.e. label sizes for which label can- 
didates start to touch. For each conflict size, one then tries to find a complete 
labeling. Obviously, a site can be seen as a variable, the set of label candidates of 
a site then corresponds to the variable domain and intersections between label 
candidates are the constraining binary relations. Instead of computing all satis- 
fying variable assignments, finding one is usually sufficient in the map labeling 
context. This allows to reduce the search space dramatically since a variable 
can immediately be assigned an unconstrained value from its domain if there is 
such a value. The algorithm for label size maximisation suggested in IWWfi7l 
uses this property and implicitly achieves arc consistency in time linear in the 
number of sites. 

When maximising the number of labeled sites, label sizes are fixed and one 
cannot give up and try a smaller label size as soon as it turns out that there is 
no complete labeling for the current label size. Systems where one cannot expect 
to find a complete solution, i.e. a non-conflicting variable assignment, are called 
over-constrained systems. In such systems one has to be content with imperfect 
solutions. Most effort in the GSP community has been directed to finding solu- 
tions that violate as few constraints as possible fFW92L[T^I7;^I.IFM9fij . When 
labeling maps, such violations would result in label overplots and thus poor leg- 
ibility. It would be possible to take the output of an algorithm which mimimises 
the number of violated constraints and then do some postprocessing. In order 
to get rid of the violations, one could drop a subset of the variables and re- 
sign from labeling the corresponding sites. Unfortunately the problem of finding 
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the smallest subset of variables such that all constraints between the remaining 
variables are satisfied, corresponds to the vertex cover problem and is in itself 
NP-complete. 

A related problem, Max-CSP, has also been investigated. There, one is inter- 
ested in finding a maximum (weighted) subset of the constraints such that there 
is an assignment that satisfies them all. In order to reduce label number max- 
imisation to Max-CSP, one adds a new value A to the domain of each variable. 
A has a unary constraint of low weight; i.e. it only constrains itself. A variable 
which is assigned A then corresponds to an unlabeled site in our setting. The 
bad news is, however, that for general Max-CSP even arc consistency is NP-hard 

mm- 

Therefore we took a different ap- 
proach. We developed a weak form of 
local consistency for our context and 
then wrote a map labeling algorithm 
based on this concept. The algorithm 
first establishes local consistency. Then 
it repeatedly makes a heuristical deci- 
sion and restores local consistency un- 
til each site is either labeled or known 
to constrain too many other sites and 
therefore not labeled at all, see Sec- 
tion 0 

From now on an instance will still 
consist of a set of variables vi, ... ,Vn, 
their domains and binary constraints 
excluding pairs of variable values, but 
for us an optimal solution is a violation- 
free assignment for as many variables 
as possible. We say that the size of an 
optimal solution is the number of vari- 
ables in the assignment. 

We present a set of rules which, 
applied exhaustively, achieve a weak 
form of local consistency. We refer to 
it as weak since we only proof that ap- 
plying these rules does not destroy an optimal solution. It would be interesting 
to see whether they are complete in the sense that after their application no 
domain of a variable can be further reduced without risking to reduce the size of 
the optimal solution if only subsets of 1, 2, or fc variables are taken into account. 
This would correspond to node, arc, and fc-consistency for classical CSP. 

In Figure [D to 0 typical situations before and after the application of a rule 
are depicted. The domain of a variable is represented by a rectangle with round 
corners, the values of a variable are dotted boxes, and the fact that two values of 
different variables exclude each other is marked by a dashed line connecting the 




Fig. 1. applying rule G1 
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Fig. 2. applying rule G2 
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Fig. 3. applying rule G3 
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corresponding boxes. Bold dashed lines mean that the corresponding constraints 
are responsible for the application of the depicted rule. Dashed lines not ending 
in a box indicate that the value from which they are emanating might constrain 
further variables. 

(Gl) If a variable v has two values wi and W 2 , and all values constrained by 
wi are also constrained by W 2 , then set Dy = Dy — {^ 2 }) see Figure [D 

Special case: If a variable v has a value w without constraints, then set 
Dv = {w}- 

(G2) If there is a subset V of variables v\, ... ,vi, each with a value Wi such 
that Wi only constrains variables in V but does not exclude any wj for 
i ^ j, then set Dy. = {wi} for i = 1, . . . , Z. 

Special case: If a variable v has a value w that only constrains a variable 
v', and v' has a value w' which constrains only v and does not exclude w, 
then set Dy = {w} and Dy> = {w'}, see Figure 0 

(G3) If the domain Dy of a variable v consists only of one value w, and the 
values wi, ... ,wi excluded by w belong to different variables vi, ... ,vi 
and pairwise exclude each other (i.e. ii w,wi, . . . ,wi form a clique in the 
constraint graph), then set Dy. = Dy. — {wi} for z = 1, . . . , Z, see FigureEl 

Note that if V is the set of all variables in the instance, then G2 yields a 
complete solution - if there is one. We show that our rules are conservative in 
the following sense. 

Corollary 1. If there is an optimal solution of size k for the given instance 
before applying any of the rules Gl to G3, then there is still a an optimal solution 
of size k after applying one of these rules. 

Proof. Assume to the contrary that the size of the optimal solution decreases 
after we remove a value u from the domain Dy of its variable v. Then every 
optimal solution tt before the elimination must have assigned u to v. Consider 
the circumstances under which u can be removed. 

- There is a value w ^ u oi v which excludes only a subset of the values of u 
(see rule Gl). But then we could replace zt by zc in tt. 

- There is a subset V of variables vi,V 2 , ■ . ■ ,vi, each with a value Wi yf u) 
such that Wi only constrains variables in V but does not exclude any Wj for z yf j 
(see G2). Then we could replace n^Vi) by Wi for z = 1, . . . ,Z without reducing 
the size of tt. 

- The variables v\, . . . ,vi constrained by u each have a value Wi such that 
u,wi, . . . ,wi pairwise exclude each other, and there is a Wj among the wfs which 
does not exclude any other value and which is the only value in the domain of 
its variable, i.e. Dy. = {wj} (see G3). Then we could replace the assignment of 
zz to z; by that of Wj to vj, again without reducing the size of tt. 
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3 Algorithm 

Our algorithm consists of two phases. In phase I, we apply a set of rules to all sites 
in order to label as many of them as possible and to reduce the number of label 
candidates of the others. These rules don’t destroy a possible optimal placement. 
Then, in phase II, we heuristically reduce the number of label candidates of each 
site to at most one. 



Phase I 

In the first phase, we apply all of the following rules to each of the sites. Let pi 
be the candidate label of site p in position i. For each of the rules we supply a 
sketch of a typical situation in the context of point labeling with four rectangular 
label candidates per point. In the pictures, we shaded the candidates which are 
chosen to label their site, and we used dashed edges to mark candidates which 
are eliminated after a rule’s application. 





Fig. 5. Rule L2 



Fig. 6. Rule L3 Fig. 7. Rule L4 



(LI) If p has a candidate pi without any conflicts, declare pi to be part of the 
solution, and eliminate all other candidate labels of p, see Figure E] 

(L2) If p has a candidate pi which is only in conflict with some qk, and q has a 
candidate qj {j yf k) which is only overlapped by pi {I yf i), then add pi 
and qj to the solution and eliminate all other candidates of p and q, see 
Figure 0 

(L3) If p has only one candidate pi left, and the labels overlapping pi form a 
clique, then declare pi to be part of the solution and eliminate all labels 
which overlap pi, see Figure 0 

(L4) If p has two neighbouring label candidates left, say pi and pj, and pi is 
only overlapped by labels which also overlap pj , then eliminate all labels 
which overlap both of them and put pi in the solution, see Figure 0 

Rule L4 can only be applied if pi and pj completely share a vertical (horizon- 
tal) edge, if all labels have the same width (height) and if they are not allowed 
to obstruct a site. Otherwise an optimal solution can be destroyed. Due to these 
restrictions, we have not used rule L4 in our experiments. 
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We want to make sure that the rules are applied exhaustively. Therefore, 
after eliminating a candidate, we check whether they can be applied in its neigh- 
bourhood, i.e. to the site of the eliminated candidate or to the sites of its conflict 
partners. 

Since the rules LI to L4 are restrictions of the more general rules G1 to G3, 
it is clear that they also have the property that if there is a solution of size k 
(i.e. k sites can be labeled) before applying any of the rules, then this is also the 
case after the rule’s application. 

Phase II 

If we have not managed to reduce the number of candidates to at most one per 
site in the first phase, then we must do so in phase II. Since phase II is a heuristic, 
we are no longer able to guarantee optimality. The heuristic is conceptionally 
simple and makes the algorithm work well in practice, see Section^] The intuition 
is to start eliminating troublemakers where we still have a choice. Spoken more 
algorithmically, we go through all sites p which have the maximum number of 
candidates, and delete the candidate with the maximum number of conflicts 
among the candidates of p. This process is repeated until each site has at most 
one candidate left. These candidates then form the solution. 

As in phase II, after eliminating a candidate, we check whether our rules 
can be applied to the site of the deleted candidate or to the sites of its conflict 
partners. 

Analysis 

In order to simplify the analysis of the running time, we assume that the number 
of candidates per site is constant. Then it is easy to see that in phase I, rule LI 
and L2 can be checked in constant time for each site. We use a stack to make sure 
that our rules are applied exhaustively. After we have applied a rule successfully 
and eliminated a candidate, we put all sites in its neighbourhood on the stack 
and apply the rules to these sites. Since a site is only put on the stack if one of 
its candidates was deleted or lost a conflict partner, this part of phase I sums 
up to 0(n + k) time, where n is the number of candidates and k the number of 
pairs of intersecting candidates in the instance, i.e. the number of edges in the 
candidate conflict graph. For rule L3, we have to check whether a candidate is 
intersected by a clique. In general, this takes time quadratic in the number of 
conflict partners. Falling back on geometry, however, can help to cut this down. 
In the case of axis-parallel rectangles for instance, a clique can be detected in 
linear time by testing whether the intersection of all conflicting rectangles is not 
empty. A simple charging argument then yields 0{k^) time for checking L3. 

The time needed for checking L4 is subsumed by that of L3. For both rules, 
checking can be done in constant time if we apply them only to candidates with 
less than a constant number of conflicts. This makes sense since it is not very 
likely that the neighbourhood of a candidate with many conflicts is a clique. In 
this case, phase I can be done in 0{n -|- k) time. 
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In phase II, we can afford to simply go through all sites sequentially and 
check whether they have the current maximum number of candidates. If so, 
we go through the candidates of the current site and determine the one with 
the maximum number of conflicts. The amount of time needed to delete this 
candidate and apply our rules has already been taken into account in phase I. 
Thus phase II needs only linear extra time. 

Putting things together, we get an 0{n + algorithm if rule L3 can be 
checked in linear time, and an 0{n + k) algorithm if we allow only constant effort 
for checking L3 and L4. In our experiments, we have not bounded this effort, 
yet this part of the algorithm showed a linear-time behaviour. Finally, for axis- 
parallel rectangular labels, the conflict graph can be determined in 0(n log n) 
time. 

4 Experiments 

We compare our algorithm to two other algorithms; simulated annealing and a 
greedy method. 

The simulated annealing algorithm we used relies on the experiments per- 
formed by Christensen et al. and follows their suggestions for the initial config- 
uration, the objective function, a method for generating configuration changes, 
and the annealing schedule |CMS95j . In order to save time, we allowed only 30 
instead of the proposed 50 temperature stages in the annealing schedule. This 
did not seem to influence the quality of the results. 

The greedy algorithm picks repeatedly the leftmost label (i.e. the label whose 
right edge is leftmost), and discards all candidates that intersect the chosen label. 
This simple algorithm runs in O(nlogn) time and has an approximation factor 
of 1/{H + 1), where H is the ratio of the greatest and the smallest label height 
!vKSW98j . 

We run our algorithm and those described above on the following instance 
classes. Figures O to El depict an example of each of these classes. 
RandomRect. We choose n points uniformly distributed in a square of size 
25n X 25n. To determine the label size for each site, we choose the length of both 
edges independently under normal distribution, take its absolute value and add 
1 to avoid non-positive values. Finally we multiply both label dimensions by 10. 
DenseRect. Here we try to place as many rectangles as possible on an area of 
size ai^/ri x aiy/n. is a factor chosen such that the number of successfully 
placed rectangles is approximately n, the number of sites asked for. We do this by 
randomly selecting the label size as above and then trying to place the label 50 
times. If we don’t manage, we select a new label size and repeat the procedure. If 
none of 20 different sized labels could be placed, we assume that the area is well 
covered, and stop. For each rectangle we placed successfully, we return its height 
and width and a corner picked at random. It is clear that all points obtained 
this way can be labeled by a rectangle of the given size without overlap. 
RandomMap and DenseMap try to imitate a real map using the same point 
placement methods as RandomRect and DenseRect, but more realistic label 
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sizes. We assume a distribution of 1:5:25 of cities, towns and villages. After ran- 
domly choosing one of these three classes according to the assumed distribution, 
we set the label height to 12, 10 or 8 points accordingly. The length of the label 
text then follows the distribution of the German Railway station names (see 
below). We assume a typewriter font and set the label length to the number of 
characters times the font size times 2/3. The multiplicative factor reflects the 
ratio of character width to height. 

VariableDensity. This example class is used in the experimental paper by 
Christensen et al. |CMS95j . There the points are distributed uniformly on a 
rectangle of size 792 x 612. All labels are of equal size, namely 30 x 7. We 
included this benchmark for reasons of comparability. 

HardGrid. In principle we use the same method as for Dense, that is, trying 
to place as many labels as possible into a given area. In order to do so, we use a 
grid of \_a 2 ^/n\ x [ 0 : 2 cells with edge lengths n. Again, a-i is a factor chosen 
such that the number of successfully placed squares is approximately n. In a 
random order, we try to place a square of edge length n into each of the cells. 
This is done by randomly choosing a point within the cell and putting the lower 
left corner of the square on it. If it overlaps any of the squares placed before, we 
repeat at most 10 times before we turn to the next cell. 

RegularGrid. We use a grid of x squares. For each cell, we randomly 
choose a corner and place a point with a small constant offset near the chosen 
corner. Then we know that we can label all points with square labels of the size 
of a grid cell minus the offset. 

MunichDrillholes. The municipal authorities of Munich provided us with the 
coordinates of roughly 19,400 ground water drillholes within a 10 by 10 kilometer 
square centered approximately on the city center. From these sites, we randomly 
pick a center point and then extract a given number of sites closest to the center 
point according to the Loo-norm. Thus we get a rectangular section of the map. 
Its size depends on the number of points asked for. The drillhole labels are 
abbreviations of fixed length. By scaling the x-coordinates, we make the labels 
into squares and subsequently apply an exact solver for label size maximisation. 
This gives us an instance with a maximal number of conflicts which can just be 
labeled completely. 

In addition to these example classes, we tested the algorithms on the following 
point sets. 

German Railway Stations. We were given the names and coordinates of 373 
German railway stations. Each station was supplied with a priority ranging from 
100 to 5000. The priority does not only refer to the size of a city, but also to 
its importance in the railway network. Stations on the border have a relatively 
high priority, for example. It would be interesting to find a way to modify our 
algorithm such that it takes priorities into account as well - otherwise cities like 
Frankfurt or Stuttgart might not get a label while relatively small towns are 
labeled properly, see Figure ESI 

Berlin Shops. The designer of a tourist map gave us the location and names 
of 357 shops in Berlin offering books, second hand cloths, records, watches. 
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antiquities, toys, jewellery, and art. The data is special in that the labels must 
be rather long to accommodate the shop names and in that it is very densely 
packed, see Figure E3 



Results 

We used examples of 250, 500, . . ., 3000 points. For each of the example classes 
and each of the example sizes, we generated 30 files. Then we labeled the points 
in each file with axis-parallel rectangular labels. We used four label candidates 
per sites, namely those where one of the label’s corners is identical to the site. 
We allowed labels to touch each other but not to obstruct sites. 

The graphs in Figures IHI to ll dI show the performance of the three algorithms. 
The average example size is shown on the x-axis, the average percentage of 
labeled sites is depicted on the y-axis. Note that we varied the scale on the y- 
axis from graph to graph in order to show more details. The worst and the best 
performance of the algorithms are indicated by the lower and upper endpoints 
of the vertical bars. The results of the greedy algorithm are indicated by dotted 
lines and squares, simulated annealing has dashed lines and rhombic markers, 
while our algorithm has solid lines and triangles. 

The example classes are divided into two groups; those that have a com- 
plete labeling and those that have not. For the former group, the percentage 
of labeled points expresses directly the performance ratio of an algorithm. For 
examples of the latter group, which consists of RandomRect, RandomMap and 
VariableDensity, there is only a very weak upper bound for the size of an optimal 
solution, namely the number of labels needed to fill the area of the bounding box 
of the instance completely. Thus for VariableDensity at most 2539 points can 
possibly be labeled. Experiments we performed with an exact solver on examples 
of up to 200 points showed that on an average about 85% of the points in an 
instance of RandomRect and usually less than 80% in the case of RandomMap 
can be labeled. Other than VariableDensity, these classes are designed to keep 
their properties with increasing point number. This is reflected by the fact that 
the algorithms’ performance was nearly constant on these examples. It might be 
worth to note that we used the same set of rules as in phase I of our algorithm 
to speed up the exact solver. 

For all examples, which have a complete labeling, our algorithm labeled be- 
tween 95 and 100% of the points. Experiments on small examples hint that the 
same holds for larger RandomRect and RandomMap examples. The greedy al- 
gorithm performed well given that it makes its decisions only based on local 
information. It was outperformed clearly by our algorithm in all example classes 
but one. On regular grid data, it achieved 100%, followed very closely by the other 
algorithms. For some of the example classes, simulated annealing outperformed 
our algorithm by one to two percent. However, in order to achieve similarly good 
results, simulated annealing needed much longer, in spite of the fact that both 
implementations use the same fast 0(n log n) algorithm for detecting rectangle 
intersections (based on an interval tree). 
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Fig. 8. MunichDrillholes (left) and VariableDensity: point number versus run- 
ning time 



In Figure 0 we present the running times of our implementations in CPU 
seconds on a Sun UltraSparc. We show the two example classes where simulated 
annealing performed most slowly and fastest. Our implementation of the greedy 
algorithm is simply based on lists and uses brute force to find the next leftmost 
label candidate. Given heaps and priority search trees, it would run faster. Our 
implementation of simulated annealing seems to be slower by a factor of 2 to 3 
than that of Christiansen et al. j( lMS95j . This difference in running time may 
be due to the machines on which the times were measured. 

Conclusion 

We have presented a simple and fast heuristic for a very general version of the 
labeling problem. Due to this generality, we could not expect to achieve any 
approximation guarantee as algorithms focussing on special label shapes. Still, 
our technique works very well in practice. The results are similar to those of 
simulated annealing, but obtained much faster. Compared to the approach in 
mm , our main emphasize was on a set of rules. It would be interesting to see 
whether it was worth to integrate the time costly matching step suggested there 
into our algorithm. 
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Fig. 11. DenseMap 



Fig. 12. DenseRect 




Fig. 13. HardGrid Fig. 14. RegularGrid 




Fig. 15. MunichDrillholes 



Fig. 16. VariableDensity 
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Fig. 17. RandomMap: 250 points, 193 lab. Fig. 18. RandomRect: 250 points, 212 lab. 





Fig. 19. DenseMap: 253 points, 249 la- Fig. 20. DenseRect: 261 points, 258 la- 
beled beled 





Fig. 21. HardGrid: 253 points, 252 lab. Fig. 22. RegularGrid: 240 points labeled 
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Fig. 23. MunichDrillholes: 250 points lab. Fig. 24. VariableDensity: 250 points lab. 
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Fig. 25. left: 357 tourist shops in Berlin, right: 45 of 63 labeled. 
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Fig. 26. 373 German railway stations, 270 labeled. 
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Abstract. In this paper we present an algorithm for 3-dimensional or- 
thogonal graph drawing based on the movement of vertices from an initial 
layout along the main diagonal of a cube. For an n-vertex m-edge graph 
with maximum degree six, the algorithm produces drawings with bound- 
ing box volume at most 2.37n® and with a total of 7m/3 bends, using 
no more than 4 bends per edge route. For maximum degree five graphs 
the bounding box has volume n® and each edge route has two bends. 
These results establish new bounds for 3-dimensional orthogonal graph 
drawing algorithms and improve on some existing bounds. 



1 Introduction 

Prompted by advances in graphics workstations and applications including VLSI 
circuit design [HEDIE2I and software engineering pa EH, there has been recent 
interest in graph visualisation in 3-dimensional space. Proposed models include 
straight-line drawings prranT). convex drawings pisi, spline curve drawings 
m, multilevel drawings of clustered graphs visibility representations mEH 
and of interest in this paper orthogonal grid drawings p F71 rmirrn rrp rra CT 

E31ES1- 

The 3-dimensional orthogonal grid consists of grid points in 3-dimensional 
space with integer coordinates, together with the axis-parallel grid lines deter- 
mined by these points. An orthogonal grid drawing of a graph places the vertices 
at grid points and routes the edges along sequences of contiguous segments of 
grid lines. Edge routes are allowed to contain bends and can only intersect at a 
common vertex. 

We shall refer to the 3-dimensional orthogonal grid as the grid and an or- 
thogonal grid drawing with no more than b bends per edge as simply a b-bend 
grid drawing. At a vertex v the six directions, or ports, the edges incident with v 
can use are denoted A+, X~, Vy+, Y~, and Z~ . Clearly, grid drawings can 
only exist for graphs with maximum degree six. Figure^shows an example of a 
2-bend grid drawing. 

The grid has been extended to higher dimensions 1231, and by representing 
a vertex by a cube for example, 3-dimensional grid drawing of arbitrary degree 
graphs has also been considered El ^ • 
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Fig. 1. A 2-bend grid drawing of K^. 



The most common proposed measures for determining a ‘good’ grid drawing 
have been the bounding box volume and the maximum number of bends per 
edge route. Using straightforward extensions of the corresponding 2-dimensional 
NP-hardness results, optimising any of these criteria is NP-hard 0. In this paper 
we look at the total number of bends as a further measure of the ‘goodness’ of 
a grid drawing. 

Table □ shows the tradeoff between the bounding box volume and the maxi- 
mum number of bends per edge apparent in the existing grid drawing algorithms. 
The algorithms of Eades, Symvonis and Whitesides im are denoted ESWl-4. 
(Their compact (ESWl) and 3-bends (ESW4) algorithms originally appeared in 
HH-) The linear time incremental algorithm of Papakostas and Tollis m is de- 
noted PT. DW refers to the algorithm presented in this paper. Also of interest is 
the Reduce Forks algorithm of Patrignani and Vargiu m which in experimental 
tests averaged less than 2.5 bends per edge route with bounding box volume 
0 . 6 ^ 3 . 



Table 1. Upper Bounds of Grid Drawing Algorithms 





ESWl 


ESW2 


ESW3 


DW 


PT 


ESW4 


DW (A < 5) 


volume 


0(n3f2) 


0(n3) 


0(n®/3) 


2.37n3 4.66 u3 


27n^ 


n® 


max bends 


7 


6 


5 


4 


3 


3 


2 



Algorithm ESWl requires the least volume at the cost of more bends per 
edge route. In fact, this volume bound is tight. Kolmogorov and Barzdin P! 
and Rosenberg (221 establish a lower bound of I7(n3/^) for the bounding box 
volume. Algorithms ESW4 and PT establish an upper bound of three for the 
maximum number of bends per edge route. Eades, Symvonis and Whitesides 
Eum] had conjectured that there does not exist a 2-bend grid drawing of Ky. 
Wood f23j presents a counterexample to this conjecture. 2-bend grid drawings 
of the other multi-partite 6-regular graphs Kq^, and K 2 , 2 , 2,2 have also 

been found PI- While it is unknown if the upper bound of 3 for the maximum 
number of bends per edge route is tight, it is easily seen that provides a lower 
bound of two m- The following open problem is therefore of interest: 
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2-Bends Problem: Does every maximum degree six graph admit a 2-bend grid 
drawing? 

In this paper we solve the 2-bends problem for maximum degree five graphs. 
Our algorithm, applied to an n- vertex m-edge graph with maximum degree six, 
produces a 2.37n^-volume 4-bend grid drawing with at most 7m/3 bends. The 
algorithm positions the vertices along the diagonal of a cube according to an 
‘approximately balanced’ ordering. From there vertices are moved in up to two 
dimensions and non-intersecting edge routes determined. 

Section 2 of this paper introduces balanced orderings. Section 3 describes a 
model for grid drawing, the types of edge routes used and how to avoid edge 
route intersections. Following the presentation of the algorithm in Section 4, we 
conclude by comparing its performance with the existing algorithms. 

Throughout this paper G is an n-vertex m-edge undirected simple graph 
with maximum degree six. We define the directed graph G' with vertex set 
V{G') = V{G) and two arcs (v,w), (u>,u) G Tl(G') for each edge {v,w} G E{G). 
The arc {v, w) is called the reversal of (u>, v). We use the notation vw to represent 
the edge {■(;, w}, the directed arc (f, w) or the edge route for {u, w}. The port at 
V used by an edge route vw is referred to as the port assigned to the arc vw. A 
total ordering < of V{G) induces a numbering (ui,f 2 , . . • ,Un) of V{G) and vice 
versa. We shall refer to both < and {v\,V 2 , ■ . ■ , Vn) as an ordering of V (G). 



2 Balanced Orderings 

Given an ordering < on R (G), if vw G E(G) with v < w we say r; is a predecessor 
of w and w is a successor of v; vw is a successor arc of v and wv is a predecessor 
arc of w. The number of predecessors and successors of a vertex v are denoted 
Pv and Sy respectively, v has cost Cy = \sy — py\. Note that a vertex has even 
cost iff it has even degree. The total cost of G with respect to a given ordering is 
the sum of the cost of each vertex. If a = min(s„,p„) and b = max(s„,p„) then 

V is said to be an (a, 6)-vertex. 

The 3-bends algorithm of Fades, Symvonis and Whitesides cniini positions 
the vertices along the diagonal of a cube according to an arbitrary ordering. 
Under this model a 2-bend grid drawing is possible iff each vertex v has Pv < S 
and Si, < 3 1241 . in which case we say v is balanced. 

We say v is positive if Sy > py and negative if py > Sy. Clearly, if Sy = py then 

V is balanced. For positive and balanced vertices v and for k > 0 (respectively, 
k < 0) v^ denotes the k^^ successor (predecessor) of v to the right (left) of v 
in the ordering. For negative v and for fc > 0 (respectively k < 0) v^ denotes 
the k^^ predecessor (successor) of v to the left (right) of v in the ordering. Two 
adjacent vertices v,w with v <w are opposite if v is positive and w is negative. 

Given an arbitrary ordering of V (G) we apply the following three rules of 
movement to each pair of opposite vertices v, w. Moving an unbalanced vertex 

V past v^ for some i {1 < i < [c„/2j) reduces c„ by 2i. It follows that each rule, 
when executed, reduces the total cost of the ordering. 
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Ml If w = ti* for some i (1 < i < [c„/2j) then move v to immediately past v'‘, 
as in Figure El 




Fig. 2. The move Ml for a (l,5)-vertex v and a (2,4)-vertex w with i = 2. 



M2 If V < < w for some i,j (1 < i < \cy/2\, 1 < j < [cu,/2j), then 

move V up to u® and move w up to , as in Figure 0 




Fig. 3. The move M2 for a (l,5)-vertex v and a (2,4)-vertex w with i = 2 and 



M3 If V < v'' = < w for some i,j (1 < i < [(c„ — 1)/2J, 1 < j < 

L(cii) ~ 1)/2J) then move v to immediately past u* and move w to imme- 
diately past , as in Figure 0 . 




Applying Ml, M2 or M3 reduces c„ by at least 2i and for each fc (1 < A: < i—f) 
Cyk is increased by at most two. For M2 and M3, Cw is reduced by at least 2j 
and for each fc (1 < A: < j — 1) c^k is increased by at most two. The cost of all 
other vertices remains unchanged. Thus for Ml the total cost decreases by at 
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least two, and by at least four for M2 and M3. An ordering of V{G) is said to 
be approximately balanced if the rules Ml, M2 and M3 cannot be applied. 

3 The ‘Unique Coordinates’ Model 

We now present a model for grid drawing where each vertex has a unique X, a 
unique Y and a unique Z coordinate. Each edge route has at least two bends, 
and a 2-bend edge route must have three perpendicular segments, as in Figured! 




We shall also use particular types of 3- and 4-bend edge routes. A 3-bend 
edge route vw, said to be anchored at v, consists of a unit length segment from 
V followed by a 2-bend edge route to ic, as in Figure E] 




Fig. 6. 3-bend edge routes vw anchored at v 



A 4-bend edge route vw, said to be anchored at v and at w, consists of 
unit length segments at v and at w with a 2-bend edge route in between, as in 
Figure H 

We represent the relative coordinates of the vertices by three orderings <jc, 
<y and <2 of V{G), denoted the X-, Y- and Z-orderings. Port assignments 
are represented by a 3-colouring of A{G') using colours {X,Y,Z}. An arc vw 
coloured / S {X, Y, Z} uses the /+ or I~ port. We also maintain a set of anchored 
arcs of G' (to be specified in Section EJ , such that an arc vw is anchored iff the 
edge route vw is anchored at v. The ports at v and w used by a 2-bend edge route 
vw must be perpendicular and point towards w and v respectively. Therefore the 
3-colouring of A{G') must satisfy: 
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Fig. 7. 4-bend edge routes vw anchored at v and at w 



— If neither an arc vw nor its reversal arc wv are anchored then they are 

coloured differently. (1) 

— For each vertex v, for each / G {X,V,Z}, there are at most two outgoing arcs 

vu, vw coloured I, and if neither vu nor vw are anchored then u <i v <i w 
or w <i V <i u. (2) 



Theorem 1. Let G be a graph. Suppose there exists X-, Y- and Z-orderings of 
V{G), a set of k anchored arcs of G' , and a 3-colouring of A{G') satisfying (1) 
and (2). Then there exists an {n-\- k/3)^-volume 4-bend grid drawing of G with 
at most 2m-\-k bends. Furthermore if k = 0 then all edge routes have two bends. 

Proof: Initially position each vertex v at (vx,vy,vz) where vj is the index 
of V in the /-ordering. Suppose the arc vw G A{G') is coloured I G {X,Y,Z} 
and its reversal arc wv is coloured J G {X,Y,Z}. Let vu be the other outgoing 
arc at v (if any) also coloured I. 

If vw is not anchored then assign vw the port /+ if v </ w and /“ if w </ v. 
If vu is also not anchored then by (2) vu will be assigned the opposite port to 
vw. If neither vw nor wv are anchored then by (1) I ^ J, and we route vw with 
two bends. 

Suppose vw is anchored. If vu is not anchored then assign vw the opposite 
port to vu, otherwise the ports Iff and Iff can be arbitrarily assigned to vu 
and vw. If wv is not anchored then route the edge vw with a 3-bend edge route 
anchored at v. If vw and wv are both anchored then route vw with a 4-bend 
edge route. 

For each anchored arc vw coloured / insert a plane at v perpendicular to the 
/-axis so that the unit length segment of the edge route vw lies between v and 
the inserted plane. The plane is considered to have an /-coordinate unique to v. 

A grid point on an edge route vw has two coordinates unique to v, one 
coordinate unique with v and one with w, or two coordinates unique to w. 
Therefore edge routes can only intersect if they are incident at a common vertex, 
and these edge routes, say vu and vw, must intersect as in Figure 0 

In each case, swapping the ports at v assigned to vu and vw reroutes the edges 
so that they no longer intersect. In (b) and (a) (if exactly one arc is anchored) 



338 



David R. Wood 




(a) vu, vw possibly anchored 




(c) vw anchored, vu possibly anchored 




(d) vu, vw both anchored 



Fig. 8. Rerouting intersecting edge routes 

the anchor is transfered to the other arc. In (a) and (c) swapping ports may 
create new edge route intersections between uv and another edge incident at u, 
or similarly at w. However in (a) the sum of the lengths of the middle segments 
of vu and vw is reduced (see the highlighted segments), and in (c) (and also in 
(d)) at least one anchored arc (and thus a bend) is eliminated. Since this sum 
and the number of bends is bounded below, by applying (a), (c) and (d) before 
(b) a finite number of swaps suffice for an intersection-free grid drawing. 

The bounding box has volume (n -|- kx) x (n -I- ky) x (n -I- kz), where kj is 
the number of anchored arcs coloured I. It is easily seen that the bounding box 
volume is maximised when it is a cube; i.e. kx = ky = kz = k/3. Therefore the 
bounding box volume is at most (n + k/3)^. An unanchored edge route has two 
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bends, and each anchored arc contributes one further bend. Therefore the total 
number of bends is 2m+k, and if A: = 0 then only 2-bend edge routes are used. I 

The algorithm described in the next section shall apply Theorem 1 with one 
anchored arc for each degree six vertex. 



4 The Algorithm 

Our grid drawing algorithm initially positions the vertices along the main diag- 
onal of a cube according to an approximately balanced ordering. At a balanced 
vertex v the positive (respectively, negative) ports are assigned to the successor 
(predecessor) arcs of v. However, at an unbalanced, say positive, vertex v the 
positive ports can be assigned to at most three successor arcs of v. The remaining 
successor arcs vw must be assigned a negative port at v. To do so we can anchor 
the arc vw (as referred to in Section ot, or move v past w in some ordering, in 
which case vw is said to be a movement arc. Table El defines the movement and 
anchored arcs for each type of unbalanced vertex. 

Table 2. Definition of movement and anchored arcs for unbalanced vertices 



V 


(0,4) 


(1,4) 


(0,5) 


(2,4) 


(1,5) 


(0,6) 


vv^ 


movement 


movement 


movement 


anchored 


movement 


movement 


vv^ 


- 


- 


movement 


- 


anchored 


movement 


vv^ 


- 


- 


- 


- 


- 


anchored 



If vw is a movement arc coloured I then v is moved to immediately past w 
in the /-ordering, thus allowing vw to be assigned the I~ port for positive v and 
the port for negative v. In Figure we illustrate the movement and anchoring 
process in the case of a positive (0,6)-vertex. 

For an unbalanced vertex v, if vw = vv'^ is a movement or anchored arc 
then i < [c„/2j , so rule Ml is applicable. Therefore w cannot be opposite to v, 
and hence wv cannot also be a movement or anchored arc. Consequently when 
edges are routed no 4-bend edge routes are immediately constructed. It is only 
through swapping ports to avoid intersections that a 4-bend edge route can be 
introduced. Furthermore, if uu* is a movement arc then i < [(c„ — 1)/2J, so by 
rules M2 and M3, if v and w are opposite unbalanced vertices then the movement 
arcs of V do not ‘cross over’ or have the same destination vertex as the movement 
arcs of w. 

To represent the 3-colouring of A{G'), we construct a graph G” with vertex 
set V{G") = A{G'). Vertices are adjacent in G" if the corresponding arcs must 
use non-parallel ports. We distinguish four types of edges of G": 

1. The first type of edge ensures that arcs which ‘compete’ for the same ports 
are coloured differently. In Table 01 we define the arcs 
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Fig. 9. u is a positive (0,6)-vertex, vv^ is a movement arc coloured X, vv^ is a 
movement arc coloured Y, vv^ is an anchored arc coloured Z; move v to v'. 

vv^ and vv^ for each type of vertex. If r; is a balanced or a positive (respec- 
tively, negative) unbalanced vertex then vv^, vv^ and vv^ will be assigned 
the negative (positive) ports at v. The arcs vv^ , vv^ and vv^ will be as- 
signed the positive (negative) ports at v. Thus vv^, vv^ and vv^ are pairwise 
adjacent in G” , and vv^ , vv^ and vv^ are pairwise adjacent in G” . 



Table 3. Definition of vv^, vv^ , vv'^ , vv^ , vv^ and vv^ 



V 


ARC' 

vv vv vv 


vv^ vv^ vv^ 


balanced 


vv~^ vv~^ vv~^ 


VV^ VtP' vv^ 


(0,4)-vertex 


vv^ 


VtP' vv^ vv'^ 


(l,4)-vertex 


vv~^ vv^ 


VtP' vv^ vv^ 


(2,4)-vertex 


vv~^ vv~^ vv^ 


vv^ vv^ vv'^ 


(0,5)-vertex 


vv^ vv^ 


vv^ vv^ vv^ 


(l,5)-vertex 


vv~^ vv^ vv^ 


vv^ vv^ vv^ 


(0,6)-vertex 


vv^ vvi^ vv^ 


vv^ w® w® 



2. If neither the arc vw not its reversal arc wv are anchored then add the edge 
{vw,wv} (labelled ‘r’) to G”. 
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3. If vw and wx are both movement arcs for some vertices v, w and x, then 
add the edge {vw,wx} (labelled ‘*’) to G" . This ensures that v and w do 
not move in the same ordering. 

4. If vv'^ is a movement arc coloured I then v will move past in the I- 
ordering. To ensure that v^v does not use the incorrect lyi port add the 
edge {vv‘^,v^v} (labelled ‘**’) to G” . Note that in Figure 0 v^v cannot use 
the port J^i. 

We now summarise our algorithm. 

1. Determine an approximately balanced ordering {v\,V 2 , ■ ■ ■ ^ Vn) of V (G). 

2. Initialise the X-, Y- and Z-orderings to be {vi,V 2 , ■ . • , Vn)- 

3. Construct and 3-colour the graph G" with colours {X,Y,Z}. 

4. For each movement arc vw coloured / G {X,Y, Z}, move v to immediately 
past w in the /-ordering. 

5. Position each vertex v at (vx,vy,vz)- 

6. For each anchored arc vw coloured I, insert a plane at v perpendicular to 
the /-axis. 

7. Route the edges and remove edge route intersections. 



Theorem 2. For a simple graph G with maximum degree six, the above algo- 
rithm will determine a ^-bend grid drawing ofG with bounding box volume 2.37n^ 
using at most Imj?) bends. If G has maximum degree five then the bounding box 
has volume and each edge route has two bends. 

Proof: To prove that the graph G" is 3-colourable we employ two operations 
which preserve the 3-colourability of a graph. Firstly, a degree one or two vertex 
V and its incident edges can be removed; v can be later coloured with the colour 
different from its neighbours. Secondly, if K 4 \ {r;,w} is a subgraph for some 
non-adjacent vertices v and w, then in any 3-colouring v and w must receive 
the same colour, so we merge these vertices and replace any multiple edges by a 
single edge. We shall now show that this process can be continued until G" has 
maximum degree three, and is not K 4 , so by Brooks’ Theorem 0 is 3-colourable. 

For an unbalanced vertex v, let //„ be the subgraph of G" consisting of the 
vertices vv^, vv^ and vv’^ and their incident edges. We shall initially show that 
Hy ‘reduces’ to a maximum degree three subgraph. 

For a degree six unbalanced vertex v, the vertex of G" corresponding to the 
anchored arc vv’^ is incident with at most two (unlabelled) edges, and therefore 
can be removed from G” . Since a (0,6)-vertex and a (0,5)-vertex v both have vv^ 
and vv^ as movement arcs, Hy is the same for a (0,6)-vertex v (after removing 
vv'^) and for a (0,5)-vertex v (see Figures CH and EJ- Similarly, for (1,5)- and 
(2,4)-vertices, Hy is the same as for (1,4)- and (2,3)- vertices respectively. We 
therefore need only consider (0,5)-, (1,4)- or (0,4)- unbalanced vertices. 
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** 


* 


[v^vj 


l vv 1 


/ / 


/ — 




1“ — “ 






1 1??; , 


1 1 \ vv ■ J 


i_ 




Fig. 10. The subgraph Hy for a (0,5)-vertexor a (0,6)-vertex u with balanced. 



Consider a (0,5)-vertex v. may be balanced or a (l,4)-vertex. If is 
balanced then, as in Figure II 1)1 vv^ has degree two and can be removed. In the 
remaining graph vv'^ and v^v have degree three. 

Now, if is a (l,4)-vertex then, as in Figure ITTl vv'^ and are the 

non-adjacent vertices in a 1^4 \ {e} subgraph. If we merge these vertices then v^v 
and vv^ have degree two and can be removed. If is balanced then there is no 
edge (labelled If is unbalanced then v'^ must be a (1,4)- 

vertex, and therefore v'^v and the edge {vv^^v^v} (labelled ‘r’) will be removed 
(see FigureEJ- In either case vv"^ (=u^(u^)^) has degree three. 





Fig. 11. The subgraph Hy for a (0,5)-vertex or a (0,6)-vertex v with a (1,4)- 
vertex. 



Consider a (l,4)-vertex v and assume that v~^ is not a (0,5)-vertex with 
(ti“i)i = V (we have already considered this case). As in Figure El the vertex 
vv~^ has degree two and can be removed, vv^ now has degree at most three. For 
a (0,4)-vertex v, Hy simply consists of the degree one vertex vv^, which can be 
removed. 




Fig. 12. The subgraph Hy of G” for a (l,4)-vertex or a (l,5)-vertex v. 
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Consider a vertex nw® £ V{G") for some i £ {D,E,F} or f £ {A,B,C} if 
V is balanced, vv'^ is incident with at most two unlabelled edges and to at most 
one edge labelled ‘r’. Unless is a (0,5)- or (0,6)-vertex and = v (in which 
case vv^ is incident with an edge labelled and has already been considered), 
vv'^ has degree at most three. 

We have shown that all remaining vertices in G" have degree at most three, 
and it is easily seen that G" is not iT 4 , by Brooks’ Theorem G" is 3- 
colourable. We shall now show that this 3-colouring satisfies the conditions of 
Theorem 1. 

The edges of G” labelled ‘r’ guarantee condition (1) holds. The unlabelled 
edges in G" ensure that at most two outgoing arcs at each vertex v receive the 
same colour. Suppose the arcs vu £ and vw £ {vv^ ,vv^ ,vv^} 

are coloured I £ {X, Y, Z}, and vu and vw are both not anchored. 

If vu is not a movement arc then v is between u and w in the initial ordering, 
and V does not move past u or w in any ordering. If u moves past v then it 
does so in the same ordering as the colour assigned to the movement arc uv. 
Since non-anchored reversal arcs are coloured differently uv is not coloured /, 
so u does not move in the /-ordering. Otherwise, if vu is a movement arc then 
u is between v and w in the initial ordering. In the /-ordering v moves past u 
and, since the movement arcs of w (if any) do not cross over or have the same 
destination vertex as vu, w cannot move past u in any ordering. Therefore v is 
between u and w in the /-ordering, and condition (2) holds. 

We have thus shown that Theorem 1 is applicable. If k is the number of 
unbalanced degree six vertices (and therefore the number of anchored arcs) then 
Theorem 1 asserts G has a 4-bend grid drawing with bounding box volume 
(n -|- k/'Yf and 2m + k bends. Since k < n the bounding box volume is at most 
(n -I- n/3)^ = (4n/3)^ < 2.37n^. If d is the average degree of those vertices 
without anchored outgoing arcs then 6fc -|- d{n — k) = 2m and the number of 
bends is 2m -I- (2m — d{n — k))/6 = 7m/3 — d{n — k)/6. Since n > k the grid 
drawing has at most 7mj3 total bends. 

For maximum degree five graphs, no anchored arcs are introduced. By The- 
orem 1 the bounding box has volume and each edge route has two bends. 



5 Experimental Results 

In Table 0we compare the performance of existing grid drawing algorithms with 
the algorithm presented in this paper. For the 3-bends (ESW4) algorithm fDI 
CD, the Papakostas and Tollis (PT) algorithm C3 ^^e Reduce Forks (RF) 
algorithm P2|, we have used the implementations incorporated in the 3DCube 
system of Patrignani and Vargiu m- The Kneser graph consists of the 
6-subsets of {1, 2, . . . , a} whose intersection has cardinality c. 

In most cases the algorithm presented in this paper outperforms the other 
algorithms both in terms of bounding box volume and the total number of bends. 
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Table 4. Performance of 3D orthogonal grid drawing algorithms 









Avg. 


(Max.) bends 


per 


edge 




Bounding box volume /n® 


graph 


n 


m 


FSW4 


PT 


RF 


DW 


FSW4 


PT 


RF 


DW 


Ke. 


6 


18 


2.60 


(3) 


2.40 


(3) 


0.80 


(2) 


2.00 


(2) 


20.15 


1.56 


0.06 


0.58 


Kr 


7 


21 


2.57 


(3) 


2.24 


(3) 


1.71 


(6) 


2.29 


(3) 


23.32 


1.31 


0.73 


1.49 


7^2, 2, 2, 2 


8 


24 


2.75 


(3) 


2.29 


(3) 


2.25 


(8) 


2.25 


(3) 


23.76 


1.58 


1.96 


1.42 


7^3,3, 3 


9 


27 


2.74 


(3) 


2.26 


(3) 


2.78 


(8) 


2.22 


(3) 


24.11 


1.51 


3.16 


1.37 


7f6,6 


12 


36 


2.72 


(3) 


2.31 


(3) 


3.67 


(9) 


2.17 


(3) 


24.81 


1.79 


3.75 


1.27 


7^1,1 


10 


30 


2.67 


(3) 


2.30 


(3) 


1.67 


(7) 


2.27 


(4) 


24.39 


1.72 


0.62 


1.58 


Klo 


15 


45 


2.58 


(3) 


2.31 


(3) 


2.87 


(7) 


2.24 


(4) 


25.24 


1.85 


2.01 


1.62 


Kl^o 


462 


1386 


- 




2.21 


(3) 


- 




2.14 


(3) 


- 


1.77 


- 


1.47 
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Abstract. This paper proposes a multidrawing approach to graph draw- 
ing. Current graph-drawing systems typically produce only one drawing 
of a graph. By contrast, the multidrawing approach calls for systemati- 
cally producing many drawings of the same graph, where the drawings 
presented to the user represent a balance between aesthetics and diver- 
sity. This addresses a fundamental problem in graph drawing, namely, 
how to avoid requiring the user to specify formally and precisely all the 
characteristics of a single “nice” drawing. We present a proof-of-concept 
implementation with which we produce diverse selections of symmetric- 
looking drawings for small graphs. 



1 Introduction 

Imagine you have a graph and you want a nice drawing of it. You don’t know 
what a nice drawing for this graph looks like, but you think you can recognize one 
when you see it. What do you do? First, you try a known graph-drawing method. 
The drawing it returns is not ideal, so you modify the system’s constraints or 
parameters or random-number seed in the hope of producing a drawing you like 
better. This typically results in a haphazard and tedious exploration of drawings 
which may or may not result in one that you like. 

You might prefer instead to look at an organized selection of drawings that 
were chosen to show the diversity of drawings possible, subject perhaps to very 
general aesthetic guidelines that you supply. Then you could pick the ones that 
you like best, and maybe even ask the computer for more drawings similar to 
those. In this way, you and the computer would be collaborating in a system- 
atic way to learn what you mean by “nice” for this graph, and to produce one 
or several suitable drawings. The main result of this paper is to introduce (in 

* This research was supported in part by funding from NSERC and FCAR. 
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Section 0 the multidrawing approach as a realization of this idealized graph- 
drawing system, and to present a simple proof-of-concept implementation (Sec- 
tion Our implemented system is called smile (for Symmetric Multidrawing 
Layout Experiment), and it generates a diverse selection of symmetric-looking 
drawings for a given small input graph. Section 0 gives several directions that 
future experimentation and research might take, and Section concludes. 

2 Graph Multidrawing 

Graph multidrawing is best explained operationally. The canonical multidrawing 
system has four principal subsystemsu 

Layout: The layout subsystem must be capable of producing a wide variety of 
drawings of the same graph, either by exploiting randomization or by varying 
the input parameters. Fortunately, many drawing algorithms have this capability 
inherently (e.g., those based on spring simulation), and many other algorithms 
can be modified to have it. 

Since many layouts of the same graph are to be generated, either the graph 
should be small or the layout subsystem should be fast. 

Dispersion: The dispersion subsystem seeks to produce a selection of aesthet- 
ically pleasing drawings that cover the space of possible drawings of a given 
graph. These twin considerations of aesthetics and diversity typically require 
compromise: optimizing with respect to aesthetic criteria (e.g., number of edge 
crossings, degree of symmetry, distribution of edge lengths, etc.) usually implies 
a small number of optimal drawings, yet diversity implies a large number of 
different drawings. Thus, a good dispersion heuristic should consider aesthetics, 
but not optimize with regard to them. 

Since an effective dispersion heuristic must also achieve diversity, a way to 
quantify diversity is needed. This in turn requires a way to measure the (dis)simi- 
larity of two drawings. The emergence of measuring drawing similarity as an 
important concept is thus one of the interesting consequences of the multidrawing 
approach. 

Presentation: How to present as many as several hundred drawings computed by 
the dispersion subsystem is an important practical issue. An organized display, 
in which similar drawings are grouped together so that they can be browsed 
in a systematic fashion, is obviously preferable to an arbitrary arrangement of 
drawings. 

Feedback: In any diverse selection of graph drawings, some will be preferred over 
others. The ideal feedback subsystem would give the user an easy way to request 
a further selection of “good” drawings, with fewer “bad” drawings included. 

^ The progenitor of the multidrawing approach is the Design Gallery''^“ project of 
Marks et al. |J], in which a similar approach is taken to a variety of computer- 
graphics and animation production tasks. 
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The next section describes our smile multidrawing system, which has complete 
layout, dispersion and presentation subsystems, but (currently) no feedback ca- 
pabilities. 



3 Smile: A Proof-of-Concept Implementation 

As a proof-of-concept, we have implemented the smile multidrawing system, 
which generates a diverse selection of symmetric-looking drawings for a given 
input graph. 

We chose to experiment with the symmetry aesthetic for two reasons. First, 
it has been hypothesized to lead to attractive drawings |S|, and yet it has been 
found relatively unhelpful for certain common graph-comprehension tasks nm. 
Our hope was that multidrawing might contribute to the debate regarding sym- 
metry in graph drawing. Secondly, we had available the glide system CH to 
use as a layout subsystem. Glide uses a spring-based layout algorithm that can 
be readily tailored to foster symmetric-looking layouts. 



3.1 What SMILE Does 

Smile takes a graph as input and produces as output a large number (128 
for the examples shown below) of different drawings of it. How smile works is 
detailed in the next subsection. Here we describe the interface to the presentation 
subsystem, and show a sampling of the 128 different drawings that the system 
produced for each of a few well-known graphs. 

Figure ^ shows smile’s browser interface, with which the user can inspect 
the computed set of drawings for a given input graph. This interface is similar to 
those described in detail in |5| and using a multidimensional-scaling layout 
method, similar drawings are located near each other in the main window; this 
window can be panned and zoomed interactively and interesting drawings can 
be viewed in their own individual pop-up windows. 

Figure El shows several drawings of graph Kq computed by SMILE. Although 
Kq is a relatively simple graph, the drawings illustrate well the complicated 
interplay between aesthetics and diversity that makes graph multidrawing an 
interesting idea. For example, this selection shows that there is more to making 
pleasing drawings than just symmetry. It is clear from inspection that edge 
crossings, edge lengths, and vertex and edge gestalts (perceptual grouping) all 
play important if ill-defined and subjective roles in the way humans perceive 
drawings. 

We have also experimented with the Ala , 5 and Petersen graphs. Although only 
slightly larger and less uniform in structure than Kq, the variety of drawings for 
these graphs produced by smile is considerably greater, as shown in Figures El 
and El 
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Fig. 3. 




Several different drawings of computed by SMILE. 













Fig. 4. Several different drawings of the Petersen graph computed by smile. 



3.2 How SMILE Does It 

Now we describe how smile’s layout and dispersion subsystems work.0 

Layout: The layout subsystem was derived from the glide system El Given 
an arbitrary initial configuration of a graph’s vertices, it attempts to produce 
a straight-line drawing in which the vertices and edges are symmetric about a 
vertical axis, or about both vertical and horizontal axes. Given different initial 
configurations of vertices, it will produce different drawings. Of course it cannot 
achieve a perfectly symmetric-looking drawing if the initial input graph has no 
symmetry, and it also sometimes fails to produce a symmetric-looking drawing 
even when one is possible. In both such cases, the typical result is a drawing that 
looks fairly symmetric, with perhaps one or two asymmetric vertices or edges. 



^ The elements of the presentation subsystem are described in [2| and [J]. 
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The underlying layout algorithm uses a generalized notion of spring force to 
move vertices from their initial to final positions. The spring forces are of two 
kinds: one kind discourages vertex- vertex and vertex-edge overlaps in an obvi- 
ous fashion. The other kind encourages symmetry explicitly, unlike the implicit 
propensity towards symmetry that arises in more conventional spring-based sys- 
tems 0. The layout algorithm differs from the original glide algorithm in two 
small, but significant ways. First, the computation of symmetry-inducing forces 
in the original algorithm requires that each vertex be matched with another ver- 
tex or with itself; in the current algorithm, vertex matchings that promote edge 
symmetry are favored over those that do not. Second, the spring- force simula- 
tion is run to quiescence once without vertex-edge forces in effect, and then once 
again with them in effect. This aids greatly in avoiding unfortunate local energy 
minima of the simulated physical system. 

Dispersion: One way to achieve dispersion is to distill a large number of ran- 
domly generated drawings down to a small, diverse set; an alternative method 
is to refine repeatedly a current set of drawings so as to diversify the set. Both 
methods were tried by Marks et al. [Z], with the latter current-set method ex- 
hibiting markedly superior performance. However, this earlier work sought only 
to achieve pure diversity, and did not balance diversity with aesthetics, as called 
for by our multidrawing application. 

The dispersion heuristic used by smile extends the previous current-set 
method by considering both diversity and aesthetics (i.e., symmetry) in the 
set of drawings. Each member of the current set consists of both an initial ver- 
tex configuration that gets passed to the layout subsystem, and the resulting 
drawing that gets returned. A new candidate member can then be generated by 
randomly selecting an existing member of the current-set, perturbing its initial 
vertex configuration, and computing a new resulting drawing. The dispersion 
subsystem works by repeatedly generating candidate members, and substituting 
them for existing members whenever the substitution will improve the current 
set. In our case, improvement is quantified by a combination of higher symmetry 
scores and greater nearest-neighbor distances between drawings in the current 
set0 

Symmetry scores are straightforward: they are a weighted sum of the number 
of vertices and edges that are drawn symmetrically with respect to the chosen 
symmetry axis or axes. Given the capabilities of our layout subsystem, many 
drawings will have the same symmetry scores. 

Devising a useful distance or similarity measure between drawings is much 
harder. We experimented with several measures before arriving at the following 
one: We greedily match compatible vertices that are at similar places in the 
two drawings, and sum up a measure of their distance. More precisely, for each 

® For each graph it processed, smile used a current set of 128 members, and considered 
5,000 candidate members. This takes from 19 minutes (for Ke) up to 95 minutes (for 
the Petersen graph) on a MIPS RIOOOO processor. See jZ] for more details about the 
dispersion process. 
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vertex u in drawing Di we find the unmatched vertex v in D2 such that u and 
V have the same set of neighbors, and such that u and v are closest according 
to a composite distance measure. This distance measure is a combination of the 
Euclidean distance between the vertices’ locations in their respective drawings 
(normalized about the centers of their bounding boxes), and the differences in 
their horizontal and vertical rankings in their respective drawings. Once this 
matching of vertices is complete, the distance between two drawings D\ and D2 
is just the sum of the pairwise composite distances between matched vertices. 

Figure n illustrates anecdotally how the inverse of our distance measure cor- 
responds to perceived similarity. The drawings in the two pop-up windows on 
the lower left are rated as very similar to each other, and the drawings in the 
pop-up windows on the lower right are also rated similar to each other. However, 
each drawing in the left pair is rated as having little similarity with each drawing 
in the right pair. Although smile’s current similarity measure correlates posi- 
tively with perceived similarity, we anticipate that deriving better measures of 
drawing (dis)similarity will be one of the main technical challenges of effective 
graph multidrawing. 



4 Future Directions 



In this section we speculate on possible further developments of the four com- 
ponents in the multidrawing architecture: layout, dispersion, presentation, and 
feedback. 



4.1 Layout 



We would like to incorporate other layout techniques into our current system 
architecture. Developing such techniques will typically require modification of 
existing algorithms to make them produce multiple different drawings of a given 
graph. Promising candidates include specialized algorithms, such as those for 
drawing trees, hierarchical graphs, and those for drawing planar graphs or sub- 
graphs. 



4.2 Dispersion 

With respect to the dispersion component, the main challenge is to fine a better 
measure of similarity of two drawings. This is crucial not only for dispersion, 
but also for organizing drawings logically in the presentation subsystem, and 
perhaps also for exploiting user feedback (e.g., by providing useful responses to 
the command: “Generate more drawings like this one.”). 
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Similarity measures may be categorized as follows^ 

— Topological similarity is the similarity of the planar graphs obtained by pla- 
narizing two drawings. Two drawings are topologically identical if they induce 
the same planar graph in the same planar embedding with the same outerface 
(0,P-32). 

— Metric similarity measures the similarity of two point sets, which for us are 
the drawn vertices p. To extend the usefulness of this technique to graphs 
may require matching labeled point sets, i.e., matching each point in one set 
with a specific point in the other set 0 . 

— Positional similarity measures the positions of nodes relative to one another, 
for example by computing for each pair of nodes whether they are in the 
same relative horizontal position in both drawings, and in the same relative 
vertical positions in both drawings. 

— Feature similarity exploits the notion of prominent features, such as a few 
faces of big area, the shape of the convex hull, or a piece that “looks like a 
tail.” 

— Operational similarity can be measured by computing the number and/or 
magnitude of operations needed to transform one drawing to another. An 
operation is applied to some piece of the drawing, e.g., reflection of the piece 
about an edge, or a change in its proportions 0 

4.3 Presentation and Feedback 

If the only goal is to achieve diversity of a set of drawings, user feedback is of lim- 
ited relevance: at most, the user might indicate a new degree or type of diversity 
for a future run. However, graph multidrawing combines diversity and aesthetics. 
The incorporation of aesthetic criteria makes it more desirable, even necessary, 
for the user to provide feedback. For example, once the user has selected some 
“nice” drawings, the system could generate a new batch with similar charac- 
teristics. To accomplish this, the system must identify which qualities the user 
“likes” in a drawing, and then map them onto the relevant system parameters 
in order to generate more drawings of the desired quality. 

5 Conclusion 

Instead of producing a single “optimal” drawing, a graph-drawing system should 
generate a diverse selection of acceptable drawings for the user’s perusal. This 
restatement of the computer’s role in the graph-drawing enterprise introduces 
many new challenges: modifying existing algorithms to generate multiple layouts; 
formalizing the notion of diversity for a set of drawings and devising heuristics 
to achieve it; and designing interfaces to support the user’s browsing task. 

^ For another taxonomy of similarity measures, see 0. 

® According to this categorization, the smile system currently uses a combination of 
metric, positional, and operational similarities. 
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Abstract. The Graph Layout Toolkit is a family of portable graph draw- 
ing and layout libraries designed for integration into graphical user inter- 
face application programs. When visualizing graphs, it becomes essential 
to communicate the meaning of each graphical feature via text labels. We 
present the interface and the basic engine of the Graph Layout Toolkit 
that produce a high quality automated placement of labels for edges of 
a graph. 



1 Introduction 

Graph layout is the automatic positioning of the nodes and edges of a graph 
in order to produce an aesthetically pleasing drawing that is easy to compre- 
hend. Graph drawings can be used to display complex information that can be 
modeled as objects and connections between the objects. In many diagramming 
applications, it is essential that a drawing be labeled. Labels assist in conveying 
information or clarifying the meaning of complex structures. The problem of 
positioning labels corresponding to graphical objects of drawings is called auto- 
matic label placement. This is very important for visualization tools in numerous 
areas such as project management, software development, database design, and 
network management. We present our approach to the above problem as imple- 
mented in the Graph Layout Toolkit. 

Many graph layout and editing systems have been developed in the past. 
Please refer to P for an overview of such systems. One essential aspect that 
has not been addressed in any previous system, is the capability to support the 
automatic placement of labels related to the edges of a drawing. 

The Graph Layout Toolkit (GLT) j5l6j is a family of graph layout libraries 
that facilitate easy integration with graphical user interface programs for the 

* Research supported in part by NIST, Advanced Technology Program grant number 
70NANB5H1162. A patent for these and related results is pending. 
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development of applications that require diagramming visual interfaces. Graph 
layout comes in different styles, each having particular features and benefits 
suited for different industries and applications. The GLT offers four different 
layout libraries: Circular, Hierarchical, Orthogonal, and Symmetric. 

Just as graph layout is a time consuming and monotonous task, so is the 
positioning of labels. The automatic placement of edge labels falls into the class 
of NP-Hard problems P| . Recent advances offer efficient solutions to the problem. 
Each library in the GLT is equipped with fast algorithms for the automatic 
placement of edge labels which are based on In this paper we present some 
of the challenges of incorporating a labeling interface into a graph layout system, 
and the way we chose to resolve these in the framework of the Graph Layout 
Toolkit. 

2 Objectives 

The labeling of edges is aimed at communicating edge attributes in the most 
convenient way. This is only possible when labels are positioned in the most 
appropriate places. 

Good label placement aids in conveying the information that labels represent 
and enhances the aesthetics of the drawing. It is difficult to quantify all the char- 
acteristics of a good label placement since they reflect human visual perception, 
intuition, tradition and/or experience. However, one can follow some basic rules: 

Elimination of ambiguity: A label which is associated with exactly one 
edge, must not overlap any other edge or any node. Otherwise it is not clear 
which object the label describes. 

Clarity: Relationships between labels and edges should be easily identified 
without cluttering the drawing. Thus, labels are positioned close to, but not 
overlapping edges if possible. 

Flexibility: Placement constraints on the labels should be allowed. For in- 
stance, in some applications it is required that a label is associated with one 
of the endpoints, or the middle, of an edge. 

It is important to emphasize here that the user must be able to customize the 
rules of label placement quality to meet specific needs. For example, the user 
must be able to specify that the preferred position for an edge label is closer to 
the source or target node of the associated edge. Building a tool that supports 
automatic labeling presents two main challenges: 

— Devise efficient algorithms that produce high quality label placement com- 
pared with manual placement. 

— Build a labeling interface that is flexible enough to accommodate the specific 
requests for good placement for a variety of applications. 

The next section details the framework and the interface built to integrate the la- 
beling algorithm into the GLT. After that, the labeling algorithms are described. 
Figured shows an example of GLT’s edge labeling facility. 



358 Ugur Dogrusoz et al. 



^ 0 / 

□ 






□ 



Ch 







□ 

□ 



□ II .D 

- □-□ 



Fig. 1. Sample drawing with edge labels produced by the GLT. 



3 Interface 

Each edge label is represented by a rectangle in the GLT. A label’s position, 
where its reference point should be placed, is determined by a percentage distance 
from the source of its owner edge and an offset from this point on the edge. These 
two values are kept constant over changes of the interactive routing of the edge 
unless the user explicitly changes either of these values (see Figure EJ. When 
a label is repositioned interactively (e.g., dragged with the mouse) these two 
values are recomputed based on the point on the owner edge that is closest to 
the label’s new position. 

The automatic label placement can be performed either during layout or 
independently, on the current drawing of a graph. In the latter case, the posi- 
tions of other graph objects are preserved while labels are repositioned by the 
algorithm. 

GLT’s tailoring options for labeling are quite flexible and allow the users to 
customize the system to their specific needs. A user can specify the preferred 
position to place a label with respect to the associated edge by specifying a 
distinct style, association, and orientation as defined below: 

Style: The style of a label specifies whether the label should be placed above 

or below, for horizontal edges, or to the left or right, for non-horizontal edges. 

Figure 0 shows examples. 
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Fig. 2. Notice how the relative positions of labels with respect to edges are 
preserved as the routing of an edge changes since the percentage distance from 
source and offset values of the labels are fixed. 



Association: The association of a label specifies whether the label should 
be placed towards the top, center, or bottom of its owner edge. In other 
words, it associates the label with the source, center, or target of the edge, 
respectively. See Figure 0 for an example. 

Orientation: The style and association of a label might have different mean- 
ings depending upon the orientation chosen. One can use either a global ori- 
entation of the drawing (based on the y-coordinates of the endpoints of the 
edge), or an edge orientation (based on the direction of each edge). 

In addition, there are a number of global tailoring options that can be used 
to fine tune the algorithm. When drawings are very dense or there is a large 
number of oversized labels, the default label assignment produced by the labeling 
system might not be satisfactory. In these instances, the user can fine tune 
the algorithm either by requiring the labeling algorithm to spend more time in 
the post-processing step as discussed later, or by relaxing the labeling quality 
constraints by allowing overlaps. For the former case, a label positioning quality 
parameter ranging from 1 to 10 sets the intensity of the algorithm, which is 
mostly related to the post-processing step. A higher integer value results in 
more accurate positioning of labels under certain circumstances but it takes 
longer to execute. For the latter case, the allowed overlap percentage parameter 
determines if the labels are allowed to overlap with one another and with other 
graph objects. When set to 10 percent, for example, each label’s dimensions are 
treated as if they were 10 percent smaller, which increases the overall success 
of the algorithm at a cost of up to 10 percent overlap. Figure 0 illustrates this 
tailoring option with an example. 



360 Ugur Dogrusoz et al. 




Fig. 3. Label style can be set such that the label is placed to the left (left) or 
right (right) of its owner edge. 

4 Algorithms 

The algorithms used in the labeling engine of the Graph Layout Toolkit are based 
on the techniques presented in |3|. First, a number of potential label solutions for 
each edge is carefully selected. Then, an assignment of labels to available label 
solutions is performed by solving a variant of the matching problem. Specifically, 
first, the label positions are grouped such that each label position, that is part of 
a group, overlaps any other label position that belongs to the same group. This 
results in mutually disjoint sets of label positions. Next, edges are matched to 
label positions by allowing at most one label position from each group to be part 
of a label assignment. Lastly, a post-processing step is performed if necessary. 
Labels are assigned to edges by locally shifting already assigned labels followed 
by a limited number of backtracking operations. 

The algorithm tries to place labels to respect the tailoring options (preferred 
position) . If it does not succeed, then it tries to find a place that is as close to the 
preferred position as possible (acceptable position). In the final label assignment 
produced by the algorithm, each label will not overlap other labels or nodes or 
edges other than its associated edge. 

The labeling techniques presented in |3], however, are not suitable for orthog- 
onal drawings because such drawings have many horizontal edges. To overcome 
this deficiency the labeling techniques have been extended to broaden the initial 
set of label positions for horizontal edges. Figure 0 shows an example. 
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Fig. 4. An illustration detailing how label association affects its placement. 




Fig. 5. A circular drawing where labels are allowed to overlap other graph objects 
to a certain extent. 
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Fig. 6. An orthogonal drawing with edge labels which contains many horizontal 
edge segments. 



The algorithms have further been extended to support placement of more 
than one label per edge. Multiple labels per edge are needed not only when 
edges are very long and repetition is necessary, but also when more than one 
attribute per edge must be displayed (see Figure 0 as an example). An iterative 
approach has been applied to solve the problem of assigning multiple labels to 
each edge of a drawing. At each iteration, one label is assigned to each edge of 
the drawing. Each successive round respects the previously placed labels and 
reduces the solution space accordingly. For more details of the algorithms, refer 
to E). 



5 Conclusion 

The Graph Layout Toolkit provides generic algorithms for automatic placement 
of edge labels. The interface to these algorithms comes with per label and per 
graph tailoring options that not only provide input to the algorithms about spe- 
cific constraints on the placement of labels, but also to adjust several parameters 
that let the user fine tune these algorithms. 

One natural extension to the Graph Layout Toolkit’s labeling support is 
integration of algorithms that can handle not only edge labels but also node 
and even graph labels. Another one of the future research goals on labeling is to 
design efficient interactive and incremental labeling algorithms for dynamically 
changing graphs. 
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Fig. 7. A U.S. map with airline routes where labels are used to convey fare and 
distance information. 
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Abstract. Techniques for drawing graphs based on force-directed place- 
ment and virtual physical models have proven surprisingly successful 
in producing good layouts of undirected graphs. Aspects of symmetry, 
structure, clustering and reasonable vertex distribution arise from initial, 
formless clouds of points. However, when nodes must be labeled and point 
vertices are replaced by non-point vertices, simple force-directed models 
produce unreadable drawings, even for a moderate number of nodes. This 
paper describes the application of two post-processing techniques that 
can be applied to any initial vertex layout to produce uncluttered layouts 
with non-point nodes. 



1 Introduction 

In general, drawing undirected graphs is problematic. The difficulty lies in there 
being too much freedom. If structure is imposed on the graph, workable tech- 
niques arise. For example, if the graph is interpreted to have a directed flow, 
one can employ the Sugiyama-style algorithms!^ E|. Alternatively, one can 
suitably restrict the layout style in order to get a handle on the problem. As 
an example here, we can consider the class of orthogonal layouts, for which a 
collection of well-developed and analyzed algorithms is available E3 El • But 

without using such restrictions, there is, at present, no simple non-heuristic al- 
gorithm for efficiently drawing general undirected graphs. 

The most effective techniques for handling undirected graphs are based on 
virtual physical models. These techniques, going back to Eades0 and, compu- 
tationally, to Kruskal(I3, represent the vertices of a graph as physical objects 
subject to various forces, natural and unnatural. Some subset of the forces en- 
codes the edge information of the graph, typically as an attractive force between 
the two endpoints of the edge. The object then becomes one of repositioning 
the nodes in order to minimize the energy of the system or to achieve a stable 
configuration vis-a-vis the forces acting on the particles. Standard techniques, 
such as steepest descent or discrete iteration, can be used to search for the de- 
sired configuration, although there is always the possibility of only finding a local 
minimum. Once the final node positions are determined, the drawing is typically 
completed by connecting edge endpoints with line segments. 

In practice, these layout methods are remarkably good, especially given the 
naive nature of the algorithms (cf. jTT^rTjV The resulting drawings typically cap- 
ture many symmetries of the graph, while avoiding the expensive computations 
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Fig. 1. The effect of non-point nodes 



involved in explicitly looking for symmetries m- In addition, these methods 
identify significant clusters of nodes while producing a reasonable distribution 
of the nodes. There has been a variety of work on this class of algorithms [Tll fTTil 
HI ini rni FI im o R] leading to some quite efficient algorithms that can handle 
“medium-sized” graphs. 

Despite the efficacy of these approaches, problems arise when they are ap- 
plied to the practical case of drawings with non-point nodes. Due to the varying 
shapes and sizes of the nodes, the resulting drawings tend to be cluttered, with 
nodes overlapping, if not totally hiding, each other, and edges visually lost in 
the confusion of being routed over or under the nodes. These problems are par- 
ticularly prevalent in the common case of graphs with little symmetry and much 
connectivity. A typical example is shown in Fig. [D Even modest sized graphs 
become unreadable. Despite this, such drawings appear in the literature. 

A simple solution is to scale the drawing up, while preserving node sizes, until 
the nodes no longer overlap (cf. Fig. EJ. Although this is simple and creates a 
similar layout, preserving symmetries, it can be wasteful of space, and one must 
still handle edge-node overlaps. We ask if there is a better way to modify an initial 
layout, repositioning the nodes to avoid overlap, while preserving proximity, 
clusters and the overall structure and layout, and using less area. Given that 
most of the graphs that arise in practice have little symmetry, we are willing to 
forego preserving symmetries. 

This paper describes the use of two general-purpose post-processing mech- 
anisms for improving these layouts to make the drawings more readable using 
modest additional area. Inspired by work of Lyons et al. I2H, the first pass re- 
moves overlapping nodes by iteratively constructing a Voronoi diagram for the 
graph and moving nodes within their Voronoi cells. The second pass, based on 
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previous edge routing workjS), draws edges as smooth curves to avoid node-edge 
overlaps. 

In the following section of this paper, we present a more detailed description 
of the post-processing heuristics that we employ. This is followed by a discus- 
sion of our current implementation of these heuristics, and sample output and 
timings. We conclude with a consideration of related work and future avenues 
for this work. 



2 Cleaning the Layout 

Given a cluttered layout, such as the one shown in Figure 0 our goal is simple: 
gradually adjust the nodes until there is adequate space between them, while 
trying to preserve their relative positions and distances, and then construct the 
edges as smooth curves avoiding the nodes. 

To achieve the first part, we use a variant of a technique developed by Lyons 
to break up clusters of nodes in a graph drawing and achieve a more uniform 
distribution. In our version, we construct a window containing the centers of all 
the vertices, and then construct the Voronoi diagram relative to this window, 
using the vertex centers as sites. The centroid of the Voronoi cell associated 
with a vertex represents the point most removed from any other vertex, and 
it is to that point we move the vertex. By iterating this process, the vertices 
move away from each other, reducing overlaps. At the same time, this motion 
largely maintains the relative node positions from the initial layout, as desired. 
Of course, if the current window is not large enough, each vertex will converge to 
the centroid of its cell and no further adjustment is possible. At times, therefore, 
it is necessary to increase the window size and allow the drawing as a whole to 
expand. Eventually, the repositionings and expansions are sufficient to remove 
all node overlaps. 

Once the nodes have been moved so that there are no overlaps, we then draw 
the edges. We do not want to use line segments since, even with the additional 
space, they will conflict with the node representations and obscure the graph 
structure. Basically, edges should avoid nodes except for endpoints. The use of 
polylines is possible, but we rejected these for aesthetic and cognitive reason^. 
We feel that some form of smooth curve works best for edges, reducing to a 
straight line where possible. 

To construct a curve representing an edge, we first find the shortest path 
connecting the midpoints of the two nodes and avoiding the interiors of any 
other nodes. This relies on Dijkstra’s algorithm and a one-time construction of 
the visibility graph of the vertices of the polygonal node^. We then attempt 
to connect the two nodes by a line segment or a Bezier curve segment. If none 
of these can avoid crossing into the interior of a node, we find the vertex on 

^ A further discussion of these issues, and our view of them, can be found in Dobkin 
et al.0. 

^ The cost of constructing the visibility graph is amortized over the construction of all 
the edges. 
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the shortest path furthest from the trial curve, break the problem into two 
subproblems and solve each recursively. Further details can be found in 0 • 

3 Implementation 

Our current prototype implementation is based on a simple pipeline of two pro- 
grams, with two elements repeated and using the dot UH format for describing 
graphs. The first component of the pipeline is neato|TH|. which takes an undi- 
rected, attributed graph and lays out the vertices as points using an implemen- 
tation of the Kamada-Kawai m algorithm, itself a close relative of Kruskal’s 
multidimensional scaling approach |1 7L Q . In addition to determining node co- 
ordinates, it also specifies the polygon associated with a nod^E 

The second component of the pipeline takes the nodes and their enclosing 
polygons, and radially expands the polygons by a given amount. This ensures 
that there is adequate space between the nodes in the final layout. In addition, it 
determines an initial enclosing window for the vertices, either from user specifi- 
cations or by expanding the enclosing isothetic rectangle by a given percentage, 
typically 5 %. It then applies the procedure, described in the previous section, 
of iteratively computing the Voronoi diagram of the vertices and moving each 
vertex to the centroid of its Voronoi cell, expanding the window when necessary, 
until none of the expanded polygons overlap. It then emits the graph with the 
new node positions. We use Fortune’s O(nlogn) algorithm to compute Voronoi 
diagrams. Polygon intersection checking uses a simplified version of the linear 
algorithm found in 0’Rourke|^, preceded by a bounding box check. 

Despite the basic simplicity of this phase, there is still room for many vari- 
ations. For example, with each iteration, one must decide which nodes to move 
and when to increase the area. Clearly, if the area is increased, the nodes on the 
perimeter must be moved in order to take advantage of the new area. Our initial 
strategy was to only move overlapping nodes as long as progress was being made, 
i.e., the number of overlaps was reduced. If progress was stalled, we increased 
the area and moved all the nodes. We found that any more subtle strategies 
along these lines (e.g., follow the same strategy as above but increase the area 
only if two consecutive iterations show no progress) only increased the number 
of iterations required and the final area. And, in fact, we found that the fewest 
iterations and the least area were produced by using the even simpler approach 
of always moving all nodes and increasing the area whenever the number of 
overlaps does not go down. 

The final component of the pipeline is another instantiation of neato. In 
this case, it accepts the given node coordinates, without repeating the Kamada- 
Kawai phase, and moves to the final layout passes of routing edges as smooth 
curves and generating device-dependent output. As with most of our tools, out- 
put formats include PostScript, GIF, and HPGL, as well as simple dot output. 
The various component algorithms used to construct edges as smooth curves are 
described in jS|, along with their respective complexities. 

^ Non-polygons, such as circles and ellipses, are approximated by many-sided poly- 
gons. 
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4 Examples 

As an example of our approach, we applied it to the graph shown in Fig. ^ and 
arrived at the drawing given in the left-hand side of Fig. El For comparison, the 
drawing shown on the right-hand side of Fig. [Dwas created by dilating Fig. dj 
preserving node size, until the nodes no longer touched. Each of these figures 
is scaled differently to fit the presentation here. If, however, they were scaled 
equally, so that corresponding nodes on the 3 graphs would have identical size, 
the drawing on the left in Fig. E| would be 1.46 times larger than the drawing 
in Fig. n while the drawing on the right would be a factor of 3.9 larger. As 
for running times, the simple neato layout took 0.18 second^ user time. Our 
technique required 3.72 seconds, with 0.18 seconds spent in the initial layout, 
0.12 seconds used to adjust the node positions, requiring 25 iterations, and the 
rest used to do the final spline routing. Again for comparison, simply scaling the 
drawing and then performing the spline routing cost 4.31 seconds. 

Table Q provides additional comparisons between the Voronoi technique and 
scaling. We applied both methods to 9 graphs that arisen in practice. Columns 
4-6 show, for the Voronoi method, the number of iterations needed to remove 
all node overlaps, the (linear) increase in the bounding box needed for the new 
layout, and the amount of time this processing took. Columns 7-8 show the 
size increase and time to remove node overlaps by scaling. We note that the 
Voronoi approach achieves significantly better use of space. It also takes appre- 
ciably longer, but this difference is negligible, as the final, edge routing pass 
dominates the layout times. The graphs dust. dot, rmf20k50.dot, dpd.dot and 
houston.jsa.dot are shown in Figs. El El and El respectively. 

Additional examples of this technique are shown in Fig. El through Fig. El 
In each case, the left layout shows the original straight-line drawing based on 
Kamada-Kawai; the right layout shows the result after applying our two post- 
processing adjustments. 



4 



All timings reported were done on an SGI Octane. 
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Table 1. Comparison of applying the Voronoi and scaling techniques for remov- 
ing node-node overlaps 



Graph 


Nodes 


Edges 


1 Voronoi 


1 Scaling 


Iterations 


Size 

increase 


Time 

(secs.) 


Size 

increase 


Time 

(secs.) 


tube. dot 


10 


18 


3 


1.1 


0.005 


1.49 


0.008 


dust. dot 


16 


73 


8 


1.61 


0.045 


2.54 


0.016 


rmf20k50.dot 


20 


56 


2 


1.0 


0.011 


1.49 


0.011 


inet.dot 


24 


51 


1 


1.0 


0.012 


1.50 


0.017 


houston.jsa.dot 


32 


98 


2 


1.0 


0.019 


1.64 


0.013 


dp d. dot 


36 


108 


25 


1.95 


0.316 


3.93 


0.031 


gte_u.dot 


49 


260 


19 


1.77 


0.336 


3.02 


0.044 


ngkl0_4.dot 


50 


100 


16 


1.20 


0.294 


3.36 


0.042 


jho5E.dot 


213 


269 


21 


1.78 


1.843 


3.24 


0.452 




Fig. 3. dust .dot 




Fig. 4. rmf20k50.dot 
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Fig. 5. houston.dot 



5 Related Work 



Concerning the problem of overlapping objects in straight-line drawings, an al- 
ternative approach is to modify the virtual physical model to prevent or at least 
minimize such overlap. For example, Kamps et al. US! introduce an additional 
energy term representing the amount of overlap of the nodes’ bounding boxes. 
Similarly, Wang and Miyamoto j2B| modify the forces between nodes, so that 
if two nodes overlap there is no attractive force between them and there is a 
stronger repulsive force. Davidson and Harel(5 make nodes and non-incident 
edges repel. 

The problem is that imposing an overlap penalty still is no guarantee that 
objects are correctly separated. This is particularly apparent in some layouts 
of large or dense graphs. More research is needed to understand possible in- 
terference between the new energy terms and the original model. Besides, the 
straight-line edge restriction can impose other problems. 

Fisheye techniques E23 can adjust spacing between layout objects, but are 
intended for interactive environments. It is not clear how they can be applied 
successfully to static layout. 

Wills [2S| proposes a post-processing phase for virtual physical models. The 
method is to compute nearest neighbors for all nodes, and increase the separation 
between node pairs that are too close. It is not clear what happens if there is 
not enough space locally. 
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As previously mentioned, our approach was inspired by Lyons et al. E3 • 
They propose improving layouts of points in the plane that typically represent 
graph nodes. The goal is to more evenly distribute the points within a fixed layout 
window, while maintaining the basic “shape” of a drawing. The method relies 
on measures of similarity and distribution. Layouts are updated iteratively by 
either a Voronoi diagram or spring model, moving points until either the drawing 
becomes too dissimilar to the input, or a desired level of distribution has been 
reached. The proposed models for similarity and distribution do not take into 
account non-point nodes or edges of the graph. 

Edge routing can be handled by any technique that satisfies certain aesthetic 
goals (e.g. make paths that are short, smooth and avoid unnecessary inflections). 
An alternative to our method described in Section 0 was proposed by Abello and 
Gansner p. 



6 Conclusions and Future Work 



The proposed approach makes readable layouts by a hybrid of virtual physical 
modeling, geometric constraints and representing edges as smooth curves. The 
approach is compatible with any initial layout. The resulting diagrams preserve 
much of the original structure, attempt to use little additional space, and can 
be constructed efficiently. 

This technique also raises some interesting problems. One area concerns the 
layout itself. The technique prevents unwanted node-node and node-edge inter- 
sections, but doesn’t address edge-edge intersections. Such intersections can be 
seen in FigureEl Global edge routing might be a good way of attacking this prob- 
lem. The idea would be to determine when edge routes might interact (perhaps 
when path edges are shared or are proximate) and then to introduce new barriers 
corresponding to implied routing constraints. Alternatively, it might be possible 
to introduce some aspects of edge routing into the force-directed model, thereby 
connecting node and edge layout more closely, as is done in most Sugiyama-style 
layouts for directed graphs. 

Another issue is that the technique of using Voronoi diagrams to break up 
clusters of nodes really represents a family of algorithms, and leaves open the 
decisions about what nodes to move, where to move them, when to increase 
the drawing’s bounding polygon, and what type of bounding polygon to use. 
We discussed some of the variations we tried in Section El Adjustment of node 
positions also potentially interacts with the force model and disturbs local sym- 
metry. More experiments are needed to understand these interactions and good 
tradeoffs. We notice that our layouts tend to disperse nodes more than necessary. 

There is a question concerning whether Voronoi diagrams with polygonal 
node shapes as sites make better layouts. If so, there is a problem of how to 
compute these diagrams incrementally as nodes move. Finally, there is a need 
for robust implementations. 
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Abstract. This paper describes DA-TU, which combines an animated 
clustering and an online force-directed animated graph drawing method 
for the visualization of huge graphs. 



1 Introduction 

Graphs which arise in Information Visualization applications are typically very 
large: thousands, or perhaps millions of nodes. Recent graph drawing competi- 
tions have shown that visualization systems for classical graphs are limited to 
(at best) a few hundred nodes. 

Attempts to overcome this problem have proceeded in two main directions: 

Clustering. Groups of related nodes are “clustered” into super-nodes. The user 
sees a “summary” of the graph: the super-nodes and super-edges between 
the super-nodes. Some clusters may be shown in more detail than others. 
An example is in Figured Note that “New South Wales” is shown in more 
detail than “Victoria” . 

Navigation. The user sees only a small subset of the nodes and edges at any 
one time, and facilities are provided to navigate through the graph. 

This paper introduces DA-TU, a system which combines both approaches. 
The following section briefly describes the model on which DA-TU is built. Some 
remarks on the implementation of DA-TU, especially with respect to the cluster- 
ing force model and the animation model, are in Sections 0 and 0 samples of 
interaction with DA-TU are described in an appendix. 

Note: the aim of this paper is to briefly describe the features of DA-TU. The 
rational behind the design will be described elsewhere. Note that DA-TU is an 
animated interactive system and it is impossible to fully describe its features on 
a static page. A video is available from the authors on request. 

2 The Framework 

The DA-TU system manipulates data in levels, as illustrated in Figure 0 We 
describe each of these levels, and the functions that operate on them. 



S.H. Whitesides (Ed.): GD’98, LNCS 1547, pp. .174- rTOl 1998. 
(c) Springer- Verlag Berlin Heidelberg 1998 



A Fully Animated Interactive System 375 




Fig. 1. Clustered graph. 




Fig. 2. The framework of DA-TU. 



2.1 The Graph Level 

A graph in DA-TU is a classical undirected graph, consisting of nodes and edges. 
In applications it is a very large graph, containing many thousands of nodes. 
The graph may be dynamic, that is, the node and edge set may be changing; 
these changes may be a result of user interaction through DA-TU, or they may 
be changed by an outside agent. 

2.2 The Clustering Level 

A clustered graph C = (G, T) consists of an undirected graph G = (V, E) and a 
rooted tree T such that the leaves of T are exactly the vertices of G jS] . Clustered 
graphs are closely related to compound graphs jOl ^ ; however, compound graphs 
are more general. Each node v oiT represents a cluster of vertices of G consisting 
of the leaves of the subtree rooted at i/. The tree T describes an inclusion relation 
between clusters; it is the cluster tree of G. Figure Q] shows a clustered graph. 

DA-TU can operate on a clustered graph G = (G, T) by two basic operations, 
create and destroy a cluster. Both can be performed by user interaction, or by 
an algorithm attached to DA-TU. If a vertex is added to the graph at the graph 
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level, then at the clustering level it is assigned the root of T as a parent; it may 
be moved to another cluster by the above operations. 

2.3 The Abridgement Level 

In applications, the whole clustered graph is too large to show on the screen; 
further, it is too large for the user to comprehend. DA-TU draws “abridgements” . 
An abridgement of the clustered graph in Figure Qis shown in Figure |3 We now 




Fig. 3. An abridgement. 



give a formal definition of “abridgement” . Suppose that C/ is a set of nodes of 
the cluster tree T. The subtree of T consisting of all nodes and edges on paths 
between elements of U and the root is called the ancestor tree of U . An example 
of an ancestor tree is in Figure 0 




Fig. 4. The light shaded area is the ancestor tree of the dark shaded nodes. 

A clustered graph C' = {C ,T') is an abridgement of the clustered graph 
C = (G, T) if T' is an ancestor tree of T with respect to a set U of nodes of T and 
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there is an edge between two distinct nodes u and v of G' if and only if there is an 
edge in G between a descendent of u and a descendent of v. Figure 0 shows the 
abridgement of Figure^with basis {Sydney , N ewcastle , Tasmania , Victoria}. 

DA-TU has two elementary operations on abridgements; these change the basis 
of the abridgement. They are open a cluster and close a cluster. 

2.4 The Picture Level 

Pictures of clustered graphs are shown in Figures mm and in Appendix. In this 
section we define a “picture” of a clustered graph. A picture of a clustered graph 
C = (G, T) contains a location p{v) for each vertex v of G and a route c(u, v) for 
each edge {u, v) of G, in the same way as drawings for classical graphs. Further, 
a picture has a region b{v) of the plane for each cluster v of T, such that if n is 
a leaf of T then b{y) is located at p{v)^ and if /r is a child of in T then 6(^) is 
contained in b(y). The regions currently used by DA-TU are rectangles; we plan 
to use convex polygons in the next version. 

DA-TU provides the usual operation of manually moving nodes in a picture. 
However, the main role of DA-TU is animated automatic layout. This is described 
in the next two sections. 

3 The Force Model 

In this section, we briefly outline the force model piainiizi DA-TU has three 
types of spring forces: 

— internal-spring: A spring force between a pair of vertices in the same clus- 
ter. 

— external-spring: A spring force between a pair of vertices in different clus- 
ters. 

— virtual-spring: A spring force between a vertex and a virtual (dummy) 
node along a virtual (dummy) edge. 

It is best to describe these forces with an example; see Figure The internal- 
spring forces on vertex c are along the edges (c, a) and (c, b); the external-spring 
forces on c are along the edges (c, /) and (c, g). 

Virtual-springs can be described using a virtual node in each cluster. In the 
clusters X, V, and Z, virtual nodes x', y', and z' are shown; each virtual node 
is connected to each node in its cluster by a virtual edge. Virtual-springs exert 
forces along these edges. Note that virtual nodes and edges are not shown in the 
actual picture of the clustered graph unless the user wants to see them. 

As well as spring forces, between each pair of nodes there is a gravitational 
repulsion force. 

The forces are applied additively to give an aesthetically pleasing layout of 
the graph. The sum of forces on each node is continually computed, and the 
nodes move according to the strength and direction of these forces. The details 
of the forces and their implementation is described elsewhere. The force approach 
is computationally expensive. However, at any one time, DA-TU only deals with 
a small graph, and there are no problems running on an average PC. 
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- Internal-spring Virtual-spring 

- External-spring 



Fig. 5. Spring forces. 



4 Animations 

In DA-TU, the whole visualization is fully animated. Every transition, whether 
triggered by the user, DA-TU, or by another agent, has its own specific animation. 
This greatly reduces the cognitive effort of the user in recognizing the new view 
and change; we aim for a full preservation of the user’s “mental map” 0. 

More specifically, there are eight types of animation that are implemented 
in our system. Five of these are specifically related to the clusters: animated 
gathering, the animation of cluster boundaries, the animation of scaling opera- 
tions, and animated opening and closing of clusters. Three types of animations 
are similar methods described in animated viewing, animated drawing, and 
animated addition and deletion of nodes and edges. 

5 Conclusions 

DA-TU provides methods for handling huge graphs visually. The user begins with 
a picture of an abridgement of the huge graph. At all times, a force-directed 
animated layout algorithm ensures that the picture is aesthetically pleasing, and 
that transitions between pictures do not destroy the “mental map” |3|. The 
Appendix below gives screen dumps from sessions with DA-TU. 
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Appendix: Examples 

This section contains two sequences of screen dumps from DA-TU. They illustrate 
the basic operations of the system and how it works to achieve a better quality 
of the layout of clustered graphs. The animation which is an essential feature of 
DA-TU is lost in these static pictures; a video is available from the authors. 




Fig. 6. A graph in DA-TU. 



380 



Mao Lin Huang and Peter Eades 




Fig. 7. The user creates five new clusters on the graph in Figure El However, 
this layout has five overlaps among the clusters. 




Fig. 8. The user applies the spring forces in the “gathering” operation to elim- 
inate the overlaps in Figure |3 This greatly improves the readability of the 
layout for the user. 
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Fig. 9. The same layout as shown in Figure |H| however, the virtual nodes and 
edges are shown. Note the “virtual spring” force applied between non-adjacent 
vertices c8 and cl9. 




Fig. 10. A clustered graph C with 4 levels, after the application of spring forces 
in the “gathering” mode. 
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Fig. 11. Closing one cluster, v = {61, c3, cl3, cl4}, in Figure 1101 bv clicking on 
V in the “close” mode. The representation of closed is a small rectangle. 




Fig. 12. Continuing, closing the cluster (cl, c2, c4, cl5, Cluster-!} in Fieriire rm 
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Fig. 13. Open the cluster Cluster J> in Figure 1 1 21 and then open the cluster 
Cluster A. This is done by clicking on both visual rectangles of two clusters one 
by one under in the “open” mode. 




Fig. 14. Dismissing the whole cluster tree from the layout as shown in Fig- 
ure n 31 and returning to the graph with no clusters. This could be done by 
destroying the clusters one by one; however, a “dismiss” button is provide to 
destroy all clusters at once. 
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Abstract. We demonstrate the H3Viewer graph drawing library, which 
can be run from a standalone program or in conjunction with other pro- 
grams such as SGI’s Site Manager application. Our layout and drawing 
algorithms support interactive navigation of large graphs up to 100,000 
edges. We present an adaptive drawing algorithm with a guaranteed 
frame rate. Both layout and navigation occur in 3D hyperbolic space, 
which provides a view of a large neighborhood around an easily change- 
able point of interest. We find an appropriate spanning tree to use as 
the backbone for fast layout and uncluttered drawing, and non-tree links 
can be displayed on demand. Our methods are appropriate when node 
or link annotations can guide the choice of a good parent from among all 
of the incoming links. Such annotations can be constructed using only 
a small amount of domain-specific knowledge, thus rendering tractable 
many graphs which may seem rather densely connected at first glance. 



1 Motivation and Context 

Software systems which support graph drawing occur in many varieties. Some 
systems are partially automatic but provide the user with a graphical user in- 
terface to fine-tune the resulting image 0. Others take a data file as input and 
automatically create a polished drawing as output in a batch process |5| . A com- 
mon theme that runs through most systems is a strong limit on the number of 
nodes which can be handled, either because of the presumably bounded patience 
of the human in the loop or because the totally automatic layout algorithms do 
not scale to a large number of nodes. Unfortunately, real-world graphs are of- 
ten several orders of magnitude too large for the capabilities of current software 
systems, which handle only hundreds or perhaps thousands of edges. 

In pushing beyond previous limits on graph size, we explicitly traded off 
generality for scale. Our approach fills a gap in the design space not addressed 
by previous systems. We have built a software library which supports interactive 
exploration of large graphs ranging from thousands to over 100,000 edges. Our 
library achieves this speedup by using a spanning tree as a backbone for layout 
and drawing. Tree drawing is a more tractable problem than general graph layout 
The user can draw incoming or outgoing non-tree links on demand for 
any node or subtrees. 



S.H. Whitesides (Ed.): GD’98, LNCS 1547, pp. 384-1223 1998- 
(c) Springer- Verlag Berlin Heidelberg 1998 
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1.1 Spanning Trees 

Clearly a spanning tree approach will work well with trees or directed acyclic 
graphs. There are definitely graphs where the spanning tree backbone may re- 
sult in a misleading picture, for example bipartite or fully connected graphs. 
One might assume at first glance that our method is only appropriate for graphs 
which are very tree-like and must be inappropriate for more densely connected 
ones. However, we argue that there are in fact many “quasi-hierarchical” graphs 
for which our algorithms are well-suited. Some authors use hierarchical inter- 
changeably with directed acyclic (i.e. 0), but we have a more general class in 
mind. There are many graphs which might be quite densely connected from a 
purely graph-theoretic standpoint, but a small amount of domain-specific knowl- 
edge provides a way to make a good choice of a main parent from among all the 
incoming links to a given node. Nodes or links can be annotated with this infor- 
mation. We argue that the resulting tree-based backbone drawings enrich rather 
than mislead the user. 

1.2 Abstraction 

Some interesting attempts have been made to handle complexity through auto- 
matic abstraction |E|. The work described here is complementary to such efforts. 
Abstraction and sheer scale are both important additions to the graph drawing 
arsenal of methodology. However, the use of a spanning tree can also be thought 
of as an automatic filter which elides links in the default case. 

2 System 

A graph viewer is most helpful to a user when it is integrated with other tools. 
The H3 layout and HSViewer drawing libraries have been integrated into the 
Site Manager system from Silicon Graphics, which is aimed at webmasters and 
content creators. The screen snapshot in Figure ^ shows part of the Stanford 
Graphics Group web site on display during a Site Manager interactive session. 
The 3D hyperbolic graph view shows the hyperlink structure of the site, while 
the 2D browser view shows the directory tree structure in traditional outline 
format. Selection in one view causes highlighting and transitions in both. The 
graph structure can thus be used as an index for selecting items. The ability to 
quickly select a subtree can be useful even for users who already understand the 
structure of their graph. A more prolonged discussion about the suitability of 
our methods for various other tasks can be found in a recent paper UDI. 



2.1 Interaction 

When the user clicks on a node, it is highlighted and undergoes an animated 
transition to the center of the sphere. The importance of smooth transitions 
for maintaining a consistent mental model of an onscreen structure is now well 
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Fig. 1. Site Manager shows the hyperlink structure of the Stanford Graphics 
Group Web site drawn as a graph in 3D hyperbolic space next to the directory 
structure of the site drawn with a traditional browser. The entire site has over 
14,000 nodes and 29,000 edges, of which only some in the neighborhood of a 
course homepage are drawn in this snapshot. In addition to the main spanning 
tree, we draw the non-tree outgoing links from a subtree corresponding to the 
first day’s lecture. The tree is oriented so that ancestors of the course page appear 
on the left and its descendants grow to the right. 



known H3|. The transition includes both a translational and rotational compo- 
nent, so that when a node reaches the origin its ancestors always appear on its 
left and its descendants are to the right. This “butterfly” configuration both 
provides a canonical local orientation and also serves to minimize occlusion of 
both nodes and their text labels. 

The Site Manager system features a tight integration between the 3D hyper- 
bolic graph browser and a standard 2D file browser. When a node is selected 
in one, it is highlighted and moved to the center in both. If several nodes are 
selected at once, by rubberbanding in the outline view or selecting an entire 
subtree in the graph view, they are highlighted in both but no motion occurs. 
The support for brushing allows users to correlate information across views. 
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The same criteria used to color nodes can also be invoked as a filter to show 
or hide nodes from view. Users can also choose whether to show incoming or 
outgoing non-tree links for the selected node or set of nodes. 

2.2 Speed and Size 

The layout phase is linear in the number of spanning tree edges. On an SGI 
Onyx2 a large graph with 110,000 edges was laid out in 12 seconds. A medium 
sized graph of 31,000 edges was laid out in 4 seconds, and a small graph of 4000 
edges took a fraction of a second. The drawing time is constant - the HSViewer 
drawing algorithm always maintains the target guaranteed frame rate. A slow 
graphics system will simply show less of the context surrounding the node of 
interest during interactive manipulation. Figure El shows two views of the same 
scene - one corresponding to what the user would see during interaction, and 
one after the fringe has been filled in when the user was idle. 

The layout and drawing algorithms presented here work well up to the limits 
of main memory, but not beyond: if the entire graph does not fit into main mem- 
ory, the system is unusable. The graph with 110,000 edges could be manipulated 
interactively on an SGI with 1 GB of main memory but could not be loaded on 
a smaller 128 MB machine. 

On startup, the initial loading phase includes both file I/O and data struc- 
ture building. For the graphs mentioned above, this time was approximately 2 
minutes, 20 seconds, and 2 seconds, respectively. 

2.3 Availability 

The Site Manager application can be downloaded for free from 
http://www.sgi.com/software/sitemgr.html. It runs only on Silicon Graph- 
ics machines. Version 1.0 included only the H3 layout component, whereas ver- 
sion 1.1 also incorporates the H3 Viewer guaranteed frame rate libraries. 

The underlying H3/H3 Viewer library source will soon be released for free 
non-commercial use. Gommercial use must be licensed through Silicon Graphics, 
Inc. It will run on any machine that has OpenGL, which includes most Unix and 
Windows boxes. 

3 Hyperbolic Layout 

Our layout approach can be briefly described as a “second-generation 3D hyper- 
bolic cone tree” . The basic scheme is related to the influential cone tree method 
of recursively drawing trees in 3D space H2|. Our novel H3 layout algorithm 
distributes child nodes on the surface of a hemisphere on the mouth of the cone 
rather than around its linear circumference. Details on the two-pass layout tech- 
nique and spanning tree construction are in a recent paper jS|. 

We exploit two key properties of hyperbolic space: exponential room and an 
outsider’s view. The surface of a sphere or the circumference of a circle grows ex- 
ponentially rather than geometrically as its radius increases in hyperbolic space. 
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Fig. 2. A function call graph of a small FORTRAN benchmark with 1000 nodes 
and 4000 edges. Above a single frame has been drawn in l/20th of a second, 
while below the entire graph is drawn after the user has stopped moving the 
mouse. Non-tree links from one of the functions are drawn. 
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because of the non-Euclidean distance metric. The layout problem of an expo- 
nential explosion of tree nodes can be nicely accommodated by the exponential 
amount of space in which to place them. 

The mathematical literature documents several standard projections which 
map an infinite amount of hyperbolic space to a finite section of Euclidean 
space d We use the Klein model, where the projected area is a ball in three- 
dimensional space with straight lines preserved but distorted angles. Objects 
near the center of the ball are full size, but their projected size shrinks as they 
approach (but never reach) the “sphere at infinity” which is the surface of the 
ball. This outsider view provides a natural way to focus interest in one part of 
the graph structure and see a large amount of context surrounding it. 

These two convenient properties of hyperbolic space were also exploited in 
the two dimensional hyperbolic tree browser developed at Xerox PARC and 
productized through Inxight 0. Their system only handles trees, while ours 
supports general graphs. Moreover, our layout algorithm is quite different. Using 
a spanning tree backbone in a 2D layout algorithm would necessarily lead to 
edge crossings when drawing the non-tree edges. The 3D tree layout strategy 
can accommodate non-tree edges without actual edge crossings, although from 
any single viewpoint there will be occlusion. We rely on interactive navigation 
to help the user understand the 3D structure. 

Our first straightforward attempt at drawing cone trees in 3D hyperbolic 
space, the Geometry Center wehviz system, suffered from rather low information 
density PH. The H3 layout strikes a reasonable balance between information 
density and clutter. The traditional cone tree layout in both the Xerox PARC 
Cone Tree and the Geometry Center wehviz system places nodes on a circle - 
a ID line. In H3, nodes are placed on a hemisphere - a 2D surface. In a paper 
from Carpendale et al PJ, nodes are placed in a 3D grid - a 3D volume. In all 
three of these examples, the space in which nodes are laid out is a 3D volume. 
When the dimension of the surrounding space is the same as the dimension of 
the node structures, occlusion becomes the overriding issue. The entire focus of 
the Carpendale paper is proposing various solutions for the occlusion problem. 
We choose to lay out nodes on a surface, which is a happy medium between the 
sparseness of a line and the density of a volume. An excessively sparse layout 
like the wehviz system wastes screen real estate. If our layout was too dense, the 
leaf nodes near the surface of the ball would block our view of the rest of the 
structure, since we are outside of the ball looking in. 



4 Drawing 

Our drawing algorithm depends on the number of visible, not total, number of 
nodes and edges. The projection from hyperbolic to Euclidean space guarantees 
that nodes sufficiently far from the center will project to less than a single pixel. 
Thus the visual complexity of the scene has a guaranteed bound - only a local 
neighborhood of nodes in the graph will be visible at any given time. 
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4.1 Adaptive Drawing 

A guaranteed frame rate is extremely important for a fluid interactive user expe- 
rience. The HSViewer adaptive drawing algorithm is designed to always maintain 
a target frame rate even on low-end graphics systems. A high frame rate is main- 
tained by drawing only as much of the neighborhood around a center point as 
is possible in the allotted time. The drawing algorithm incorporates knowledge 
of both the graph structure and the current viewing position. We use the link 
structure of the spanning tree to guide additions to a pool of candidate nodes 
and the projected screen area of the nodes to choose from among the candidates. 

The link structure of the spanning tree provides us with a graph-theoretic 
measure of the distance between two nodes: the number of hops from one node 
to another is the integer number of links between them. Nodes which are a short 
number of hops from the center will usually be more visible than those which are 
a large number of hops away. If we have just drawn a node, its parents and direct 
spanning tree children, which are just one hop away, are reasonable candidates 
for drawing next. However, if we simply rely on graph structure, we may waste 
time filling in sections that are less visible while neglecting those which are more 
prominent. While the hop count between two nodes does not change during 
navigation, the projected screen area of nodes does vary. Two nodes an equal 
number of hops from the center node may have much different projected screen 
areas. 

The projected screen area of a node depends on the current position of the 
graph in hyperbolic space. Navigation occurs by moving the object in hyperbolic 
space, which is always projected into the same fixed area of Euclidean space. 
Motion on the surface of the 3-hyperboloid brings some node of the graph closest 
the pole, so its projection into the ball is both closest to the origin and largest. 
Most previous systems for adaptive drawing |S| state the viewpoint problem in 
terms of camera location, but since we have a hyperbolic viewer we need to 
consider the projection of a moving object onto a fixed camera. 

The amount of time devoted to a frame should depend on the activity of the 
user. The HSViewer library allows separate control over the drawing frame time, 
picking frame time, and idle frame time. The rendering frame time is simply 
the time budget in which to draw a single frame during user mouse movement 
or an animated transition. It is clear that the drawing time should be explicitly 
bounded instead of increasing as the node/edge count rises. The time spent cast- 
ing pick rays into the scene must be similarly bounded. Rendering and picking 
should be accomplished somewhere between five and thirty frames per second - 
our current default is 20 EPS for rendering and 10 EPS for picking. When the 
user and application are idle, the system can fill in more of the surrounding scene. 
However, the time spent on this, the idle frame time, should still be bounded 
to eventually free the CPU for other tasks. On a low-end SGI machine, a few 
seconds is often enough time to fill in most of the desired detail. Our current 
default is 2 seconds. 
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4.2 Drawing Implementation 



The heart of our drawing algorithm is a loop which draws nodes from the center 
of the sphere outwards until the time budget for a frame is exhausted. The 
ActionQueue is cleared at the beginning of a frame. It is initialized with a new 
center node, which for frame ft is the node which was closest to the origin at 
frame ft-i- It is safe to rely on frame-to-frame coherency since our algorithm 
guarantees high frame rates. At the top of the loop we pop the top node of the 
ActionQueue. The candidate entities to draw are the node itself, its incoming and 
outgoing links, the parent node, and the direct child nodes. Any of these items 
which have not yet been drawn are rendered and marked as drawn. The parent 
and child nodes are then inserted into the ActionQueue. Note that although 
only nodes which are one hop away in the spanning tree are enqueued, the links 
which are drawn may include non-tree links if the user has enabled them. As 
nodes are drawn, their projected screen area is recorded and used to sort the 
ActionQueue. At the bottom of the loop, if the elapsed time is greater than the 
allotted frame time the loop returns. Otherwise, the loop is traversed again with 
the next node on the ActionQueue. 

If the system is in idle mode, control may be returned to the drawing loop 
through the invocation of an idle callback. In that case, the old ActionQueue is 
retained rather than being cleared and initialized with a new center node. The 
drawing loop is invoked again, and is returns after another drawing frame time’s 
worth of activity. Control can be returned to the idle callback several times in 
a row, until the idle frame time is exceeded. Then the idle callback is cleared 
so that control returns to the application program until further user or program 
activity warrants new drawing activity. It is important to use the idle callback 
mechanism to trigger many invocations of the short drawing frame time loop, 
rather than indulge in drawing for the entire unbroken idle time, since the user 
may act at any time. 

All drawn nodes are kept in an additional queue, the PickQueue, which is 
also sorted by projected screen area. This queue is used by the picking routines, 
which must also have a guaranteed termination time. The pick could be occurring 
after the system has drawn several drawing frame time’s worth of nodes while 
in idle mode, so there may be more onscreen nodes than can be tested for 
intersection with the pick ray in the allotted time. The right strategy to minimize 
user frustration is to ensure that large visually distinguishable nodes can be 
successfully picked, since smaller nodes with only a few pixels of screen presence 
are less likely to be the target of user interaction. If a pick ray does collide with 
a node, that node alone is redrawn in a highlighted color to instantly provide 
visual feedback. The picking cycle of the HSViewer is fast enough to allow locate 
highlighting whenever the user simply moves the mouse over an object, and of 
course can also be used to determine whether the user has selected a node in 
response to an explicit click. 
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4.3 Attributes 

The basic geometry of nodes and links can be augmented through decorations 
such as color coding, changes of linewidth, and text labels. Such additions greatly 
increase the utility of the system. Text labels are drawn for nodes whose pro- 
jected screen areas are greater than a user-specified number of pixels. The current 
implementation draws spanning tree links with a linewidth of two and non-tree 
links with a single-pixel line. The directionality of the links is subtly color-coded, 
with the reddish end emerging from the parent and the bluish end terminating 
at the child. This color coding is non-intrusive when only the spanning tree is 
shown, but allows the user to distinguish between incoming links and outgoing 
links when non-tree link drawing is enabled. 

Many Euclidean graph drawing systems also encode information in the shape 
of the drawn node, but we have chosen to avoid this modality. The user is much 
less able to distinguish node shapes in a hyperbolic viewer than in a Euclidean 
viewer, since the number of pixels devoted to a node shrinks very rapidly as that 
node moves away from the focus point. 

In the Site Manager system the default node color coding is according to the 
MIME type of the document: HTML documents are cyan, images are purple, 
VRML is blue, and so on. When traffic statistics are inspected, the nodes are 
coded on a red to grey color gradient, where red represents the most number of 
hits and grey the least. Color coding can also be used to show dynamic data. 
A site’s traffic logs can be used to show the paths taken by Web users. A hit 
from one page to another is shown by briefly highlighting the link between them, 
which may have previously been drawn in the default link color if part of the 
spanning tree or may not have been drawn. 

5 Conclusion 

Our HSViewer library can handle graphs two orders of magnitude larger than 
previous systems by manipulating a backbone spanning tree instead of the full 
graph. We carry out both layout and drawing in 3D hyperbolic space in order 
to see a large amount of context around a focus point. Our layout is tuned for a 
good balance between information density and clutter, and our adaptive drawing 
algorithm provides a fluid interactive experience for the user by maintaining a 
guaranteed frame rate. 
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Abstract. We demonstrate the use of automatic drawing facilities in 
an interactive single-line diagram editor which is the user interface to 
a real-time power system simulator. As well as being used as a filter 
to generate ready-to-simulate schematics, the automatic drawing 
algorithm can cooperate with interactive editing actions to work 
towards obtaining a final drawing satisfying user constraints. 



1 Introduction 

We implemented in a schematic editor^ an automatic drawing facility 
based on extensions to the Sugiyama-Misue (S-M) [4] algorithm for com- 
pound digraphs applied to the drawing of single-line schematics used in 
electrical power system applications [1, 2, 3]. 

Such diagrams mainly use a visibility representation which is produced 
by our algorithm even for non-planar graphs in the presence of edge cros- 
sings and vertex clustering. 

The primary use of this facility is for generating a single-line diagram 
from existing EMTP^ files. The resulting drawing is ready to be simulated 
as all supported electrical values in the EMTP file have been translated 
into the property sheets of the generated diagram elements. 

The automatically obtained drawing has at first only horizontal or ver- 
tical edge direction. A bidirectional drawing may be obtained using either 
interactive manual editing actions combined with an automatic reconstruc- 
tion of the layout, or by incrementally rotating large or small portions of 
the drawing automatically. 

Already satisfactory portions of the drawing can also be shielded from 
further modification in a natural way. 

We will start by describing briefly the principles underlying our exten- 
ded algorithm. 

2 Quasi-Visibility Drawing of Compound Digraphs 

We call quasi-visibility the type of drawing obtained in our implemen- 
tation because our graphs are not always hierarchical planar or compound 



1. The schematic editor runs on Sun workstations under Solaris 2.5 (SunOS 5.5.1). 

2. ElectroMagnetic Transients Program. Standard batch simulation program. 
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planar [6] and the heuristic has to maintain the visibility representation as 
much as possible in the presence of bends, edge crossings and clustering. 

Our basic algorithm is limited to one orientation only for edges, increa- 
sing either in the x or -y direction. Interesting new approaches to drawing 
in both directions have been published recently [7] , but cannot be directly 
applied to our graphs because of their non-planarity and restrictions in the 
connection of edges to vertices. Our vertices must be stretched in one direc- 
tion only and edges must be connected orthogonally to the stretched side. 

3 Extensions to the S-M Algorithm 

The original S-M algorithm is divided into four steps: 



Step I 


Hierarchization: 


Step II 


Normalization: 


Step III 


Vertex ordering: 


Step IV 


Metrical layout: 



Cycle removing and compound level assigne- 
ment. 

Obtaining proper adjacency edges. 

Edge crossing reduction using the barycenter 
heuristic to reorder vertices. 

Improving vertex positioning with a variant of 
the barycenter heuristic. 



Most of our additions were concentrated in steps II and IV. 

Our extensions may be thought as adding new attributes to the layout 
produced without losing previous capabilities. These are, in increasing 
order of complexity: 



1 Multiple connection of edges with vertices. 

1 Orthogonal edge drawing. 

1 Local quasi-visibility representation. 

1 Global quasi-visibility representation. 



Multiple connection of edges (done in step IV) is easy and only implies 
using the crossing reduction obtained in step III when assigning a connec- 
tion ordering of edges to vertex. It causes a proportional stretching of the 
vertex only when a visibility representation is chosen. 

Orthogonal edge drawing is done with a variation of the method descri- 
bed in [8]. A proper algorithm for orthogonal edge drawing is necessary for 
a correct treatment of the bends introduced by the non-planarity of our 
graphs. In the planar case, only straight vertical or horizontal lines need be 
used. 

We call local quasi-visibility representation the situation where vertices 
are stretched and edges straightened only inside a given compound level 
(or cluster). In this case, edges connecting vertices in different clusters are 
joined as in the original S-M algorithm, either orthogonally or with 
straight-line paths. 




396 



Gilles Paris 



In {he global quasi-visibility representation, recursive application of the 
same algorithm to each cluster has to be forgone for a global treatment of 
edges and vertices at all compound levels, in order to stretch vertices as if 
enclosing levels of structure were transparent. 

For these representations, the definition of proper adjacency edge is 
modified (in step II) so as to reduce the number of compound hidden (or 
dummy) nodes to only these cases where vertices are ifar aparti in some 
sense. When no clustering is done, local and global quasi-visibility produce 
the same drawing. 

We can still produce with our implementation the initial layout obtai- 
ned with the Sugiyama-Misue algorithm, so that a direct comparison of the 
aesthetic choices made in each case is possible (see Fig. 6) 

The main idea behind the method used to obtain quasi-visibility is to 
allow multiple connections of a vertex to be considered as freely moving 
connection nodes and to apply (again) a barycenter method to let them 
align themselves as much as possible, even in the presence of crossings. 

The alignement is done by limiting barycentric iterations to only move- 
ments toward the right. When a crossing is encountered, connection nodes 
further to the left stop moving while those to the right of the crossing (on 
the alternate level) keep on moving to the right. The restricted movement 
produces a drawing which is not symmetrical compared to the initial S-M 
layout, but this is not perceived as a problem in our type of drawings. 

4 A Recursive Structure of Recursive Structures 

The S-M algorithm used as a basis for our method already features clu- 
stering of the graph by the use of a tree structure on top of the graph repre- 
sentation. 

We introduce another level of tree structuring in the definition of our 
bidirectional drawings. Each portion of a drawing going in the opposite 
direction (vertical or horizontal) is extracted as a full feature compound 
digraph and added as a child to the parent graph. The drawing of bended 
ibi-edgesi which are extracted edges going to or from a parent graph to a 
child graph is done by a new specialized method, while the normal exten- 
ded S-M algorithm is retained to recursively draw each child graph in its 
orthogonal direction. 

5 Constraints for Satisfactory Single-Line Diagrams 

The desired representation for complex electrical single-line diagrams 
varies with individual taste, and also often embodies electrical symmetries 
and underlying geographical positioning which are difficult to attain with 
completely automatic generation, even with a constraint grammar. Refer to 
Fig. 1 for an example of a large final drawing. 
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For this reason, our current approach has been to make both interac- 
tive and automatic actions cooperate as naturally as possible to obtain the 
final refined drawing from an automatically generated first draft. 

6 Editing with the Automatic Drawing Menu 

One approach to editing the drawing is using automatic drawing opera- 
tions on selected elements. The user makes a selection, applies the opera- 
tion and the drawing is updated. The user may undo the operation if 
needed or go on with other actions until the desired goal is attained, or 
forego the automatic drawing process and continue with manual editing 
operations. 

The automatic drawing diagram menu is shown in Fig. 2. Possible ope- 
rations, or groups of operations, which may be also invoked by a single key- 
stroke, are: 

i Fix and reposition / Update / Undo 

Automatic redrawing with portions left untouched and automatic rota- 
tion applied if needed. The previous representation may be reloaded. 

i EMTPfile 

Generate a drawing from an EMTP file selected with a file chooser. The 
filter program is automatically invoked. 

i Orthogonal branches / Oblique branches 

Use either manhattan style edges or straight lines. 

i Enclose / Separate 

Clustering or unclustering of selected elements. 
i Implode I Explode 

Collapse/expand selected elements into a small square keeping all edges 
connected. 

i EixlEree 

Freeze/unfreeze current layout of selected elements, protecting them 
from further automatic redrawing. 

i Permute 

Select a pair of elements and permute their barycentric ordering. This 
action is useful for eliminating undetected crossings, or choosing from 
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barycentric equivalent ones. The relative order of permuted elements is 
kept until the next Fix and reposition. 

i Rotate normal ! Rotate inverted ! Fuse hack 

Generate a rotated child graph keeping either the normal top->bottom 
or left->right direction of edges, or inverting it. If the parent has vertical 
edges, the child will have horizontal ones and vice-versa. A child graph 
may be fused back with its parent graph to undo this operation. 

i Show model ! Stretch bus / Vertical / Compact ! Show structure 

These are toggles for the drawing operation. The model used internally 
by the algorithm may be shown as enclosing boxes for validation. Buses 
may be stretched (visibility representation) or not (similar to the S-M 
representation). A last minute bidirectional compaction step may be 
used or not. This step violates the normal compound level hierarchy and 
some undetected collision of vertices with edges may appear. The recur- 
sive structuring of parent/child graph, clustering and fixed portions 
may be shown with differently colored enclosing rectangles. 

7 Editing with the Standard Graphic Operations 

With this alternative approach, the user concentrates only on standard 
graphic editing operations from the drawing palette, shown in Fig 3. 

The edge orientation of the topmost parent graph is important for cor- 
rect results and must be toggled appropriately prior to any editing. 

The user then applies any interactive action from the drawing palette 
to the drawing, such as rotating single elements, flipping or stretching 
them as desired, and of course any non-critical graphic action such as 
adding color or text attributes. 

If a portion of the drawing is already satisfactory, the user then selects 
and groups those elements (to any recursive level if needed) in order to 
shield it from any further modification by the automatic repositionning. 

At any time, applying the Fix and reposition action is followed by the 
following events: 

1 The list of graphic objects in the editor is scanned and candidate child 
graphs are found. Those are the connected objects having opposite edge 
orientation from the parent. 

1 Grouped objects are interpreted as fixed elements and not scanned. 

1 The relative positions of connectors between child and parent elements 
are used to infer the desired orientation of child graph edges. 

1 A unidirectional repositioning of the complete drawing is then genera- 
ted as a topmost parent graph, and automatic rotation actions are done 
to generate all child graphs. 

This method generates only 2-levels hierarchies of graphs while any 
degree of recursive embedding may be done using the diagram menu. Such 
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an existing embedding would then be reduced to two levels if the drawing 
palette technique is used afterwards, giving rise to a slightly different 
result. 

Of course, shielded fixed portions cannot always be aesthetically incor- 
porated in the complete drawing since their content is untouched by the 
positionning algorithm, but connectivity is preserved and no overlapping 
occurs. 

Further iterations of manual editing and regrouping of satisfactory 
portions of the drawing brings the user closer to the desired goal. 

Fig 4. shows part of an editing session using the diagram menu. Fig 5. 
shows a result of using only drawing palette actions. 

8 Automatic Bidirectional Drawing 

We have also recently added an automatic clustering facility along the 
lines of [5] in order to produce a bidirectional drawing by using the ratio 
cut method recursively down to a specified compound level. The revealed 
clustering is then drawn as a tree of orthogonally oriented child graphs. 

This is useful in order to produce a drawing with a more pleasing 
aspect ratio and also to emphasize structure already present in the graph 
specification. A drawback is the addition of bends to the drawing. 

9 Conclusion 

The use of the automatic drawing facility saves a lot of time at the start 
of power system studies when an EMTP file is available. 

Using the algorithm during editing may be of help to some while others 
may want to do everything manually, but its presence is transparent in the 
editor and the desired drawing may be more easily obtained with it. 

10 Future Work 

We already have the capability to implement user constraints for rela- 
tive up-down or right-left positionning for clusters or vertices. There 
remains to define a grammar for specifying such constraints to control 
automatic generation. 
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Fig. 4 - Some actions using the menu. 

a) Original graph generated from EMTP file. 

b) After extracting some child graphs. 

c) A different case, after imploding part of 
the parent graph. 

d) Changing edge orientation of parent; note 
conservation of initial orientation of c) for 
children; this action is not a direct 90° 
rotation. 




Fig. 5 - Result of automatic redrawing of manually constructed diagram. 



e) Original hand-created drawing. 

f) Automatic redrawing complemented by some permutations; the original fixed size 
border has been kept to show the small increase in area. 
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b) 



c) 



d) 



Fig. 6 - A comparison of aesthetics, 
a) Original S-M layout, 

b) with only multiple connections, c) local quasi-visibility, d) global quasi-visibility. 
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Abstract. Measuring and evaluating the runtime of parallel programs 
is a difficult task. In this paper we present tools for performance evalu- 
ation and visualization in the distributed thread system (DTS), a pro- 
gramming environment for portable parallel applications. We describe 
the visualization of a parallel trace log as an execution graph using a 
novel layout algorithm which has been tailored to expose the structure 
of multithreaded applications. 



1 Introduction 

The measurement and evaluation of parallel runtime is a problem where very 
few tools exist. We present a parallel programming environment, the distributed 
threads system DTS which consists of a parallelizing compiler, a parallel 

runtime system and evaluation tools. For evaluation of parallel applications, the 
runtime system generates a trace log of parallel execution, which is postprocessed 
to calculate the contribution of each thread to the overall runtime. Even with this 
detailed information it is often difficult if not impossible to extract important 
quantities out of the huge amount of profiling data. 

Hence, the next step was to visualize the parallel execution in a call graph. The 
goal was to be able to see several key characteristics of the execution profile in 
a glance: The overall structure of the application, its load balancing as well as 
the critical execution path. 

The problem here was to tailor the graph layout to the specific needs of call 
graph visualization. Each thread has to be clearly distinguishable from other 
threads and the execution times should be easily recognizable. Existing layout 
algorithms proved to be insufficient for this task. The main contribution of this 
paper is a novel layout algorithm suitable for the visualization of runtime graphs. 

* This research is partially supported by the DFG-Grant Ka812/4-2 “Graphenzeichnen 
und Animation” and by the DFG within SFB 382: Verfahren und Algorithmen zur 
Simulation physikalischer Prozesse auf Hochstleistungsrechnern 



S.H. Whitesides (Ed.): GD’98, LNCS 1547, pp. 403-^^21 1998- 
(c) Springer- Verlag Berlin Heidelberg 1998 
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In Section |2| we describe the generation of execution logs and profile graphs for 
parallel applications. Section ^presents the detailed criteria for the visualization 
of call graphs, in Section 2| the algorithms und layout techniques used are dis- 
cussed. Examples of parallel algorithms and their corresponding profile graphs 
are given in Section 0 



2 Timing Parallel Applications in DTS 

Although all modern UNIX operating systems provide threads, there is no inter- 
face for measuring the execution times of single threads. Standard UNIX timing 
calls like getrusage or times provide only information about the virtual pro- 
cess time. For multithreaded processes, this is the accumulated thread execution 
time. There is no information about how many CPU time has to be accounted 
to each thread. Measuring wall clock time is completely inadequate, since the 
execution times are further disturbed by other applications running on the same 
machine. Therefore we decided to trace parallel execution based on virtual CPU 
time and calculate the runtime share of each thread using our own algorithms. 

The DTS runtime system has been modified to trace the execution of multi- 
threaded parallel programs. A DTS application usually is distributed to several 
independent hosts. Each host executes multiple threads. On uniprocessor nodes 
this allows to hide communication latencies, on multiprocessor nodes we are able 
to use all available CPUs. 

Each node produces a separate logfile, gathering timestamps for all significant 
events during execution. For each of the following seven events the virtual process 
time and additional control information is logged: 



init Start of computation on local node. 

exit End of computation on local node. 

start Creation of a new thread. The thread id is logged. 

end Termination of a thread. 

fork Start of execution of a thread on local node. 

bjoin Local node issued a join on a thread. This event marks the begin of the 
join. The caller gets suspended until the thread to be joined has terminated, 
ajoin Completion of a join. The thread issuing the join continues execution. 

Based on the information in the logfiles, the computation time for all threads 
can be computed. Each thread has to be accounted for its share of the measured 
virtual process time. This is accomplished using a simple recursive algorithm, 
which relies on some basic assumptions on the thread scheduler. Details can be 
found in |3j. 
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Host "linus" 




Fig. 1. Control flow for a single master and a single slave thread 



3 A Graph Theoretical Formulation 

An introduction based on graph grammars can be found in jSj and subsequent 
proceedings of several Workshops on Graph Grammars (LNCS 153, 291, 532, 
and 1073). We give an alternative formulation: Let G = {V,E) be the given 
graph. Ignoring the init- and the exit-node (they carry no information for the 
graph) we can partition V in flve subsets: V = VsLiVeUVf Li Vbj U Vaj, namely 
start-, end-, fork-, b-join- and a-join-nodes. 

Edges of G are either flow-edges € Ef which represent a part of a thread or 
structural edges € Eg which connect a subthread with its calling thread. Thus 
edges in Eg run from a fork-node to a start-node or from an end-node to a 
join-node and do not represent any sort of run time. Each flow-edge carries the 
information about the run time of the corresponding part of the thread. 

A thread T consists of a chain vi,...,vt of nodes, vi being a start-node, Vt 
being an end-node and V 2 , ■ ■ - Vt-i being fork- and join-nodes. Each thread is 
preceded by a fork-node and succeeded by an a-join-node. The nodes V 2 , ■ ■ ■ Vt-i 
themselves are predecessors resp. successors of subthreads of T. V 2 , ■ ■ -Vt-i can 
be paired into disjoint pairs {vi, Vj) with i < j such that Vi is the predecessor and 
Vj is the successor of the same subthread. If for any two subthreads of T defined 
by pairs {vi^,Vj,^) holds: A < *2 ^ J 2 < jii then G is a series-parallel 

graph. 

Our thread visualization tool TreVis computes drawings of runtime graphs with 
the following qualities: 



— The flve types of nodes can be easily distinguished. 

— The drawing is orthogonal where every thread is represented by a chain of 
nodes in the same column. 
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— The drawing is hierarchically (top-down) such that flow-edges are drawn ver- 
tically and structural edges are drawn horizontally (with a possible vertical 
extension if necessary because of idle times). 

— The number of edge crossings is at least locally minimal; the drawing is 
always planar for series-parallel graphs. 

— Every thread T is balanced (if possible); i.e. T will be drawn near the 
barycenter of the subgraph induced by T and its subthreads. 

— The run times are represented by the node positions. Here the y-axis is seen 
as a time axis and the y-coordinate of a node is proportional to the time 
when the corresponding action is performed. 

Many applications show graphs where some edges have a very short length com- 
pared to other edges (e.g. Fig. El 0.01 vs. 3.14). We use a special scaling strategy 
here that sets short edges to a user- (or system-) defined minimum length; this 
makes it possible to distinguish the endpoints of this edge and to recognize 
the edge itself (see the first two join-nodes of the main thread in Fig. El for an 
example). If there is an edge of length zero between a b-join-node and the corre- 
sponding a-join-node, we do not distinguish between these nodes and draw them 
as a single join-node. 



4 Algorithms 

Since the edge routing is simple for given node positions (most of the edges 
are straight, some edges have one bend) the crucial part of the algorithm is to 
compute the node positions. 



4.1 Computing y-Coordinates 

Computing the y-coordinates is easy: The init-node is getting y-coordinate 0. 
Every start-, fork-, end-, and b-join-node is getting the value of the y-coordinate 
of its only predecessor plus the length of the corresponding edge, a-join-nodes 
have two predecessors (say u and w). W.l.o.g. u is an end-node and w is a 
fork- or a b-join-node. Place v at max(y-coord(u) -I- length(u, f), y-coord(w) 
-|- length(r(;, t;)). b-join-nodes have diamond shape in our drawings and indicate 
an idle time in this part of the program. Note that solid edges in the drawings 
indicate the node positions whereas dashed edges indicate idle times. 



4.2 Computing a;- Coordinates 

We only have to compute an x-coordinate for every thread because all nodes of 
the thread will have the same coordinate. We distinguish whether the graph is 
series-parallel or not which is easy to decide (this information is often part of 
the input). 
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Series- Parallel Graphs. In series-parallel graphs a series Ti {1 < i <t) with 
Ti is subthread of l)is nested. Thus they can easily be drawn 

without edge crossings. For balancing the drawing we use the following strategy: 
We treat the subthreads of T one by one ‘outermost-first’ and store for every 
step the ranges currently used by the subthreads to the left and to the right 
of T. The next subthread to be drawn will be placed at the side of T with the 
smaller range. In the example of Fig. we first choose to place the outermost 
subthread T\ of the main thread T to the left of T. T\ uses four columns there. 
T2 (starting at the second fork-node of T) will be draw at the right side of T 
and uses two columns. Since the right range (two) is smaller than the left range 
(four) we draw T3 (starting at the third fork-node of T) at the right of T. 




Fig. 2. A series-parallel graph. 



General Planar and Nonplanar Graphs. Let T be a thread, T\ and T2 two 
subthreads of T, /i and /2 the predecessors of the start-nodes of T\ and T2 ; and 
ji and j2 the successors of the end-nodes of Ti and T2. If /i is placed above of 
(before) /2 in the time-axis, but ji below (after) j2, then T\ and T2 must cross 
if they are drawn at the same side of T. We call these threads intersecting. Thus 
we have to solve the problem of finding an assignment of the subthreads of T 
to the sides L and R (left and right) of T, such that no two subthreads on the 
same side cross or more general, we want to minimize the number of crossings. 

The intersection structure of the subthreads of T can be formalized by the in- 
tersection graph Gi = (S,I), where S is the set of nodes representing the direct 
subthreads of T and an edge e = (Ti,Tj) S I exists iff Ti,Tj are intersecting. 

A non-crossing assignment of the subtreads to L and R corresponds to a subdi- 
vision of the nodes of Gi into two independent sets. This is possible only if the 
graph is bipartite. Hence, the tests whether the thread structure can be visual- 
ized using our drawing convention can be done greedily in linear time checking 
the bipartiteness property. 
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If the intersection graph is not bipartite, we cannot avoid all crossings. Since each 
remaining edge in the subgraphs induced by L and R represents a crossings, we 
have to minimize the number of those edges, or to maximize the number of edges 
between nodes in L and R. This is exactly the well-known max-cut problem 
Note that our intersection graphs are graphs similar to interval graphs Q. The 
setting of the problem here is the same as for row routing cniiaii, although 
in row routing the goal is to minimize the number of layers, vias and/or tracks, 
while we have a crossing minimization problem. Ulrik Brandes pointed out that 
the problem is very similar to the crossing minimization in linear embedding of 
graphs which has been shown to be NP-complete |^. 

TreVis currently uses a greedy heuristic for that problem with a postprocessing 
local exchange step. That performs very well in practice (see examples in Section 
0 . For the future, we plan to incorporate exact methods from combinatorial 
optimization as well. 

5 Some Examples 

Fibonacci Numbers. In this example the Fibonacci numbers are calculated, 
using the well-known recursion formula. A slightly simplified version of the actual 
code and the resulting profile graph are shown in Fig. 0 For comparision, we 
have added a layout of the same graph by a Sugiyama style algorithm. 

RSA Encryption. RSA encryption is an example for a regular non-recursive 
parallel algorithm. The main thread forks a number of slave workers, depending 
on the size of the input file, which each encrypt a single block of the plaintext 
using the RSA method. 

Fig. a shows two call graphs of the same parallel execution using 13 threads. In 
the upper graph small edges are enlarged to emphasize the overall structure of 
the application. The calling sequence can be seen clearly. The lower graph shows 
the execution in true time scale. This view is of particular interest to evaluate 
the load balancing of the algorithm. 

Bubble Merge Sort. Bubble merge sort uses a divide-and-conquer based ap- 
proach to sort integer numbers. The divide step splits the array to be sorted and 
creates two subtasks running in parallel. Each thread uses the same algorithm 
recursively to sort its part. The conquer step grabs the pre-sorted subarrays and 
combines them using merge sort. A partial view of the call graph of Bubble 
merge sort of 100000 Integers using 1024 threads can be seen in Fig. 0 



6 Conclusion 

We presented a novel approach of visualizing the execution profiles of multi- 
threaded parallel applications. The visual inspection of the parallel call graphs 
has proved to be a very valuable tool in evaluating and tuning parallel applica- 
tions. The layout algorithm presented here improved the usability and expres- 
siveness of call graphs significantly. 
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int fibCint n) { 

dts_t dts_idl ,dts_id2 ; 
int result=0; 

/* simple case */ 
if (n<=l) return 1; 

/* recurse with two threads */ 
dts_idl=f ork_f ib(n-l) ; 
dts_id2=f ork_f ib(n-2) ; 
result+= (int) join_f ib(dts_idl) ; 
result+= (int) join_f ib(dts_id2) ; 
return result ; 



int main(int argc, char *argv[]) { 
dts_init(argc,argv,NULL) ; 
for (i=0; i< 35; i++) fib(i) ; 
dts_leave 0 ; 
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Fig. 3. Source code and call graph of Fibonacci calculation using 729 threads. 
The first drawing has been produced by TreVis, the second one is by a standard 
implementation of Sugiyama’s algorithm. 
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Fig. 4. Call graphs of RSA encryption. In Fig. A small edges are enlarged to 
emphasize the calling structure, Fig. B shows the call graph in original time 
scale. 




Fig. 5. A partial view of the call graph of bubble merge sort with 1024 threads. 
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Abstract. JIGGLE is a Java-based platform for experimenting with numerical 
optimization approaches to general graph layout. It can draw graphs with 
undirected edges, directed edges, or a mix of both. Its features include an 
implementation of the Bames-Hut tree code to quickly compute inter-node 
repulsion forces for large graphs and an optimization procedure based on the 
conjugate gradient method. JIGGLE can be accessed on the World Wide Web 
at http://www.cs.cmu.edu/~quixote . 



1. Introduction 

Most general graph drawing algorithms fall into two categories: algorithms based 
on physical models, and hierarchical algorithms that place nodes on discrete layers 
[BETT94]. 

The physically based algorithms focus on undirected graphs [Ea84, KK89, FR91, 
ELM94, SM94, Tu94, Ig95, CP96, DH96]. In most of the proposed models, straight- 
line edges attract their endpoints according to a spring law, and nodes repel each 
other as if they were like-charged particles. These algorithms use numerical 
optimization to obtain a drawing whose “energy” is locally minimal. 

In contrast, the hierarchical algorithms focus on directed graphs — generally graphs 
that are acyclic or that can be made acyclic by the reversal of a few edges [STT81, 
GKNV93]. These algorithms assign nodes to discrete layers so that most edges are 
directed from lower layers to higher ones, and then permute the nodes on each layer 
to minimize edge crossings. When edges traverse more than one layer, the algorithms 
introduce “dummy nodes” on the intermediate layers that induce bends along the 
edges. 

JIGGLE provides a framework for experimenting with numerical optimization 
approaches to drawing graphs with undirected edges, directed edges, or a mix of both. 
In addition to incorporating the key aspects of the physically based algorithms, 
JIGGLE implements three significant improvements. The first is to incorporate 
directed edges into the physical model traditionally applied to undirected graphs. The 
second is to use the Barnes-Hut tree-code to compute repulsion forces in (n log n) 
time. The third is to adapt the conjugate gradient method to perform the numerical 
optimization. 

S.H. Whitesides (Ed.): GD’98, LNCS 1547, pp. 413-422, 1998. 

Springer-Verlag Berlin Heidelberg 1998 
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2. Graph Drawing as Numerical Optimization 

The physically based algorithms for undirected graphs treat graph drawing as a 
numerical optimization problem. A two-dimensional drawing of an n node graph 
corresponds to a point in and the goal is to produce a drawing that is locally 
minimal with respect an objective function that quantifies the desired aesthetic 
criteria. Given the difficulty of general optimization, these algorithms usually restrict 
themselves to objective functions that are continuous and differentiablej] This 
restriction allows them to use iterative first-order optimization techniques that depend 
solely on the gradient of the objective function. Often these algorithms do not even 
specify an objective function explicitly; rather, they specify its gradient (actually, the 
negative gradient) in terms of “force laws.” 

Usually, the laws include spring forces that attract the endpoints of edges towards 
each other and repulsion forces that push all nodes away from each other. For 
example, Fruchterman and Reingold use the following force laws: fspring(d) = d^/k; 
frepuision(d) = -k^/d. The constant k is a pre-specified optimal edge length; when an 
edge is of length k, its endpoints exert no net force on each other. 

The hierarchical algorithms for directed graphs take a markedly different 
approach. First, they assign nodes to discrete layers and create dummy nodes so that 
all edges connect nodes or dummy nodes on consecutive layers. They then permute 
the nodes on each layer to avoid edge crossings. Minimizing the number of edge 
crossings, however, is an NP-hard discrete optimization problem [GJ83]. Flence, the 
algorithms avoid edge crossings by sorting nodes and dummy nodes according to the 
mean or median positions of their neighbors on adjacent layers. The algorithms 
perform the sorting iteratively. First, they traverse the layers from first to last, sorting 
the nodes on each layer according to the positions of the nodes’ neighbors on the 
previous layer. Then, they reverse this process, sorting nodes according to the 
positions of the nodes’ neighbors on the following layer. They alternate between 
forward and reverse traversals until meeting some convergence criterion. After they 
have established the order of nodes on each layer, they optimize node positions 
subject to that order. 

We could view the hierarchical drawing approach in terms of constrained 
numerical optimization. Our objective function might be the sum of the squares of 
horizontal components of edges, so that a particular node’s contribution to the 
objective function would be minimal when its x coordinate is the average of the x 
coordinates of its neighbors. We would need two sets of constraints: the y 
coordinates would be fixed according to the layer assignments, and the nodes on each 
layer would have to be separated from each other by some minimum distance. 



* Most of the objective functions have singularities when two nodes occupy the same 
coordinates, but they handle these singularities by randomly separating the coincident nodes. 
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Unfortunately, every possible ordering of nodes on the layers would lead to a 
drawing that is a constrained local minimum. The constraints separate all of the local 
minima from each other, necessitating an optimization procedure that can jump over 
the constraint barriers. 

Instead of attempting such an approach, we use an unconstrained approach, 
replacing the minimum separation constraints with repulsion forces. 



3. Our Physically Based Model for Mixed Graphs 

Since the spring and charge model works well for undirected graphs, we looked for 
a simple modification that would take into account directed edges. The spring force 
for an undirected edge tries to minimize the distance between the endpoints. Inspired 
by the use of layers in the hierarchical approach, we made the following change in the 
spring law: when a spring is associated with a directed edge, it tries to place the “to” 
endpoint a distance proportional to k below the “from” endpoint. 

This way of including directed edges in the model works surprisingly well. It does 
not, however, behave reasonably if the graph has a directed cycle. To handle this 
situation, we borrow from the hierarchical algorithms: we preprocess the graph using 
depth-first traversal to detect directed cycles and break the cycles by reversing the 
back edges of the traversal. Having solved this problem, we discovered that it was 
necessary to introduce a short-range repulsion force between nodes and edges to 
prevent nodes from being placed on top of edges. We use an inverse-square law for 
node-edge repulsion. 

The JIGGLE interface allows the user to choose among a variety of force laws. 
The spring force can be logarithmic, linear, or quadratic. The user can set the short- 
range (d 2k) and long-range (d > 2k) repulsion forces independently as inverse- 
linear or inverse square laws. The user can also turn off long-range repulsion forces, 
as Fruchterman and Reingold do in their “grid variant” approach. The node-edge 
repulsion force is also an option that the user can toggle. 



4. Computing the Forces Efficiently 

Our straightforward computation of the spring forces requires (m) time. Beating 
this time would require some way of filtering or summarizing edges — a strategy we 
do not consider here. 

We focus instead on computing the repulsion forces efficiently. Since there are 
*/2n(n-1) node pairs, a straightforward computation of node-node repulsion forces 
would require (n^) time. If the graph is sufficiently dense (i.e. m is (n^)), there is 
little point in speeding up this computation, since the spring computation will take 
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(n^) time. Often, however, we are interested in large, sparse graphs. For these 
graphs, we can benefit significantly by reducing the computational cost of repulsion 
forces from quadratic to linear or near-linear time. 

We have implemented Fruchterman and Reingold’s “grid variant’’ approach. This 
heuristic ignores repulsion forces between nodes that are more than 2k apart in the 
drawing. If the drawing distributes the nodes sufficiently uniformly, then the 
computation of repulsion forces requires only (n) time. Unfortunately, the abrupt 
cut-off of repulsion forces can significantly distort the drawing and cause the 
optimization procedure to oscillate around the cut-off boundaries. Also, if the node 
distribution is highly non-uniform, then the computational cost may be as high as 
(n^). 

In order to reduce the computational cost for repulsion forces without significant 
loss of accuracy, we have implemented the Barnes-Hut tree-code [BH86]. Every time 
we compute the repulsion forces, we insert the nodes into a quad-tree. The root cell 
of the quad-tree represents the bounding rectangle of the drawing space. When we 
insert the first node, we split this root into four children cells which represent the four 
quadrants of the rectangle. Subsequent insertions lead us to split these cells and their 
descendants so that each leaf cell contains at most one node. We then traverse the 
nodes to compute the repulsion force acting on each node. First, we look at the 
node’s siblings in the tree, then at its parent’s siblings, then the grandparent’s 
siblings, and so forth. We compute the repulsion force between a node and a cell as 
follows: if the cell is itself a leaf, then we use the regular repulsion law for two nodes. 
If the cell is not a leaf, then we determine if the node’s rectangle and the cell’s 
rectangle intersect. If the rectangles share even a single point, we recursively 
compute the repulsion between the node and the cell’s children. Otherwise, we 
compute the repulsion force as if all of the cell’s nodes were concentrated at their 
centroid. 

The running time for Barnes-Hut depends on the height of the tree and the number 
of cells than a leaf cell can intersect. Assuming that nodes are never assigned to 
identical coordinates, we can bound the height of the tree by the number of bits used 
to represent a point in the drawing space, which we assume to be O (logn). In order 
to bound the number of cells that a leaf cell can intersect by a constant, we can 
rebalance the quad-tree by splitting cells after each insertion [Mo95]. If we perform 
this rebalancing, then the overall running time for Barnes-Hut is (n log n). In 
practice, the overhead of rebalancing the tree does not seem to justify the theoretical 
performance guarantee. We therefore use Barnes-Hut unbalanced quad-trees, relying 
on its observed (n logn) behavior. 

Computing node-edge repulsion naively would take (nm) time — an even more 
expensive operation than that of computing node-node repulsion! Here, however, we 
can take advantage of node-edge repulsion’s being a short-term force, and use a grid- 
variant technique to address it. If the average edge length is (k) and the node 
distribution is sufficiently uniform, then we can compute node-edge repulsion forces 
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in (m) time. In order to avoid the cut-off problem, we subtract a constant from the 
magnitude of the force, thereby making it continuous. We could instead use a 
variation of Barnes-Hut, but we have not implemented such an approach. 

In summary, we compute the gradient in (m + niogn) time. For very large sparse 
graphs, we might consider using the Fast Multipole Method, which computes 
repulsion forces in (n) time [GR87]. Its overhead, however, is too large to make it 
practical for graphs of less than lO"* nodes. 



5. The Optimization Procedure 

Quantifying the aesthetics as force laws and computing the forces efficiently is 
only half of the problem. We also need a procedure that computes a drawing that is 
locally optimal with respect to those force laws. 

As we noted in the introduction, the force laws actually determine the negative 
gradient of an implicit objective function. Knowing the gradient allows us to use 
iterative first-order optimization strategies. Most of the published algorithms use the 
method of steepest descent or some variation thereof. Steepest descent always 
chooses the negative gradient as a search direction, and then uses some sort of line 
search procedure to determine how far to move along that direction. If the objective 
function has a lower bound and the line search satisfies a few reasonable assumptions, 
then the method of steepest descent will always converge to a local minimum. Its 
convergence rate, however, is linear, and can be very poor in practice. In order to 
obtain better performance, we have adapted the conjugate gradient method, which has 
superlinear convergence in theory and outperforms steepest descent in practice. We 
refer the reader to a discussion and analysis of these and other first-order optimization 
techniques in any standard optimization text, such as Gill, Murray, and Wright 
[GMW81]. 

The conjugate gradient method is an iterative method that finds the unique global 
minimum of a quadratic function when its Hessian matrix (which is constant) is 
positive definite. Each iteration of the conjugate gradient method performs a line 
search; if this line search is exact, then the method will solve the minimization 
problem exactly in n iterations, where n is the number of variables in the 
minimization problem. The first iteration searches along the negative gradient; 
subsequent iterations use a linear combination of the previous search direction and the 
current gradient. 

Our problem, unfortunately, is somewhat messy. The objective function is 
certainly not quadratic, nor is its Hessian matrix positive definite. Not only does it 
not have a unique global minimum, but it will often have many local minima. 
Moreover, finite-precision arithmetic rules out an exact line search in principle, and 
computational expense limits the accuracy of the search in practice. Nonetheless, we 
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can use the conjugate gradient method on a non-quadratic objective function with an 
inexact line search, as long as we restart it from scratch whenever the current search 
direction is no longer a descent direction. 

If a function is quadratic and its Hessian is positive definite, then we can make a 
theoretical comparison of the convergence behaviors of the steepest descent and 
conjugate gradient methods. In both cases, the rate of convergence depends on the 
spectral condition number of the Hessian — that is, the ratio between the largest and 
smallest eigenvalues. If we use steepest descent, then, as we approach the solution, 
each iteration only reduces the distance to it by a factor of (( - 1 ) / ( h- 1 ))^ For the 
conjugate gradient method, this factor becomes (( - 1) / ( An elegant 

analysis of the convergence behavior of each method appears in Shewchuk’s 
introduction to the conjugate gradient method [Sh94]. 

Since the function is not quadratic and our line search is inexact, we cannot make 
any guarantees about the rate of convergence. Nonetheless, our empirical results lead 
us to conjecture that our variation of the conjugate gradient method still provides an 
asymptotic improvement on the number of iterations necessary for convergence. 

Having a method to compute the search direction, we still need to determine the 
size of the step we take in that direction. The cheapest method — a constant step 
size — has the problem that too large a step size can lead to non-convergent 
oscillation, while too small a step size results in a very large number of iterations 
before convergence. The other extreme is to use a very accurate line search, e.g. one 
based on polynomial interpolation. Such an approach, however, has the drawback 
that each line search may require many gradient evaluations. Our approach uses an 
adaptive step size. We us an empirically determined initial step size, and we increase 
or decrease this step size on each iteration based on the previous one. We perform a 
gradient evaluation to ensure that we do not overshoot; if our step size is acceptable, 
we use this computed gradient for the following iteration. 



6. Empirical Results 

In order to measure the effects of introducing Barnes-Hut and the conjugate 
gradient method, we have compiled empirical results for some undirected graphs. 
Our test suite of square meshes, complete binary trees, and hypercubes consists of 
graphs that we could easily parameterize by size and for which we could confirm the 
quality of the drawings by eye. While we cannot claim that our observations 
generalize to all graphs or even to all undirected graphs, we are at least able to 
venture a few conjectures. 

The table below shows the empirical results we obtained using a PC with a 133 
MHz Pentium processor running Microsoft’s TVM under Windows 95. We drew 
each graph under four conditions: with and without Barnes-Hut, and with either 
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conjugate gradients or steepest descent. We held all other parameters constant: k = 
100, quadratic spring force, inverse-linear short and long range node-node repulsion, 
and no node-edge repulsion. We produce the initial drawing by scattering the nodes 
randomly on an 800 by 600 rectangle. For each graph and setting, we ran seven tests 
and used the median for each measured quantity: the number of iterations, the number 
of gradient evaluations, and the total time necessary to converge to an optimal 
drawing. 

We used a conservative convergence criterion: the average of the square of the 
distances moved by the nodes on an iteration must be less than 0.01. Often we could 
have stopped the algorithm much sooner, but we did not want to introduce a 
subjective element by using an “eyeball” convergence norm. 



Several entries in the table are marked with asterisks (*); for these, the time 
necessary to converge was over half an hour. 
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The data in the table allow us to make several observations: 

1) Denser graphs require fewer iterations than sparser ones. Intuitively, we can see 
that for denser graphs, where the gradient is dominated by the spring terms, the 
objective function behaves much more linearly than it does for sparser ones, 
where the repulsion terms play a more significant role. 

2) The average number of gradient evaluations per iteration is less than 2, 
suggesting that the adaptive step-sizing works well for both the conjugate 
gradient and steepest descent procedures. 
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3) Barnes-Hut is a major improvement over the naive computation of repulsion 
forces for large graphs. It is slightly more expensive for small ones; the break- 
even point seems to be between 100 and 200 nodes, depending on the topology 
of the graph. The inaccuracies of Barnes-Hut seem to require slightly more 
iterations for convergence, but the decreased cost of an iteration more than makes 
up for the increased number of them. 

4) For the meshes and trees, the conjugate gradient method requires far fewer 
iterations than steepest descent. We conjecture that, for most graphs, the number 
of iterations for the conjugate gradient method is sublinear in the number of 
nodes, while the number for steepest descent is linear. 

5) For the hypercubes, there seems to be no clear winner among the two methods. 
In fact, the number of iterations does not seems to grow with the number of 
nodes. It is not clear whether this rapid convergence is the result of their 
logarithmic density or their high degree of symmetry. 



7. Examples 

The following are examples of drawings produced by JIGGLE using the default 
settings. For each drawing, we indicate the number of iterations and the amount of 
time used to produced it. 




63-Node Directed Binary Tree 255-Node Undirected Binary Tree 

300 iterations, 5 secs 900 iterations, 65 secs 





32 X 32 Square Mesh 
300 iterations, 135 secs 



6-Dimensional Hypercube 
80 iterations, < 1 sec 



/It / i ^ 

^]/V \rv'' 

16 -Node Binary Tree/Star 
100 iterations, < 1 sec 



16-Node Binary Tree with Level Cycles 
100 iterations, < 1 sec 



Complete Graph of 12 Nodes 
50 iterations, < 1 sec 



8 by 8 Torus 
100 iterations, < 1 sec 
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8. Future Work 

There are several directions in which we hope to extend the present work. The 
first is to generalize the force laws, taking into account nodes with width and height. 
The second is to make the objective function time-dependent. As we have seen, some 
objective functions are ideal for the early iterations, while other are more suitable for 
fine-tuning. We hope to develop a formal approach to “scheduling” the gradient as a 
function of the number of iterations. The third is to improve over-all efficiency. We 
believe that our optimization procedures can be better tuned, and we are also 
developing an implementation of the JIGGLE algorithms in C-H-. 
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Abstract. This report describes the Fifth Annual Graph Drawing Con- 
test, held in conjunction with the 1998 Graph Drawing Symposium in 
Montreal, Canada. The purpose of the contest is to monitor and chal- 
lenge the current state of the art in graph-drawing technology P)|ni^[Z|. 



1 Introduction 

Text descriptions of the four categories for the 1998 contest are available via 
the World Wide Web (WWW) jSj- Approximately 17 submissions were received, 
including two videos and two live demonstrations. The winners for Categories A- 
C were selected by a committee of experts (whose names are listed in the ac- 
knowledgements). The winner for Category D was selected by vote of all the 
symposium attendees. Conflicts of interest were avoided on an honor basis. The 
winning entries are described below. 



2 Winning Submissions 

2.1 Category A 

The theme for Category A was incremental/dynamic graph drawing. The data 
consisted of addition and deletion operations that specify how a graph depicting 
a fragment of the WWW changes over time. 

The judging committee did not award a first place in this category. However, 
two submissions were awarded “Honorable Mention” and split the prize fund. 
Although the contest rules did not specify a submission format for this category, 
the two top submissions were both recorded animations. 



S.H. Whitesides (Ed.): GD’98, LNCS 1547, pp. 423-|52HI 1998. 
(c) Springer- Verlag Berlin Heidelberg 1998 
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Figure Q] shows a single frame from the submission of U. Brandes (Ulrik.- 
Brandes@uni-konstanz.de), V. Kaab, A. Loh, D. Wagner, and T. Willhalm, 
University of Konstanz, Germany. They use an energy-based layout algorithm 
that produces a straight-line 3D drawing. It favors downward-pointing edges 
and penalizes excessive movement between consecutive layouts. In addition, the 
viewpoint is updated automatically to keep the whole graph in view, to minimize 
the change in viewer position, and to avoid occlusions. The actual Web pages 
are texture-mapped onto the sides of the cubic nodes, and can be read when the 
viewer zooms in. 




Fig. 1. Honorable mention. Category A (original in color). 



The final frame from the submission of Thomas Wurst, (wurst@informatik.- 
uni-tuebingen.de). University of Tubingen, Germany, is shown in Figure El The 
Inca incremental graph-drawing algorithm was developed within the GraVis 
system (Oj. It is similar to the well-known Sugiyama layout algorithm, except 
that the assignment of nodes to layers is not done to minimize edge crossings, 
but instead takes into account the relative positions of the nodes already there 
in an attempt to preserve the user’s “mental map” of the drawing. Note also 
the use of diagonally oriented text labels, a simple technique originally used by 
Schreiber and Friedrich 0. 
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Fig. 2. Honorable mention, Category A (original in color). 



2.2 Category B 

The graph for this category was provided by Siemens AG, and is typical of graphs 
that arise in the context of computer-integrated manufacturing. The vertices 
represent various states of a manufacturing machine, and the relations between 
them represent the possible state transitions. Each state transition is modeled as 
a path of length two with a vertex (smaller than the main vertices) in between. 
There are also text labels or tags assigned to some of the vertices (main or 
subvertices) 0 The original hand drawing looked very confusing; approximately 
two weeks of laborious manual editing were needed to refine it. The refined hand 
drawing is shown in Figure El 

The winning drawing is shown in Figure 0 It was produced by Vladimir 
Batagelj and Andrej Mrvar ([Vladimir. Batagelj, Andrej.Mrvar]@uni-lj.si) from 
the University of Ljubljana, Slovenia, using the “Pajek” system m- Because 
symmetries in the graph (with some exceptions) were very noticeable, they de- 
cided to obtain an initial layout using a Fruchterman-Reingold spring embedder. 
Then they used manual editing (which took three hours) to maximize symme- 
tries and made some adjustments to take into account the different sizes and 
shapes of nodes. 



^ Meaningless labels were substituted for the actual text to preserve confidentiality. 
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Fig. 3. Manual drawing of subject graph, Category B. 
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Fig. 4. First prize, Category B. 



The second-place drawing for Category B is shown in FigureO It was submit- 
ted by Michael Wissen (wissen@mpi-sb.mpg.de) from the Max-Planck-Institute 
for Computer Science, Saarbriicken, Germany. He used a new graph-drawing al- 
gorithm that is particularly suited to small graphs. The vertex-placement step of 
the algorithm is based on so-called region trees and aims at placing the vertices 
so that they split some region (see m)- The labelling step uses a simple greedy 
algorithm. The drawing was generated fully automatically. 



2.3 Category C 

The only information given out for Category C was that it involved an elegant 
graph of theoretical significance. In fact, the graph has girth II (i.e., no cycles of 
length less than 11), degree 3, and 112 vertices. A graph with a minimum number 
of vertices for a given degree and given girth is a cage. It has been shown recently 
that there are no graphs of girth 11 and degree 3 with fewer vertices. A manual 
drawing [n@ is shown in Figure |H| 

Several researchers reported that they found it very difficult to make sense 
of this graph using standard graph-layout algorithms. The top two drawings 
both took novel approaches to the analysis and subsequent layout of the graph. 
First prize was awarded to Petrus Abri Santoso (Peasant@acm.org) and Audi 






. 5. Second p 
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Fig. 6. Manual drawing of subject graph, Category C. 



Surjanto (Asurjant@gamma.binus.ac.id) from the Bina Nusantara University, 
Jakarta, Indonesia, for the drawing in Figure 0 They began by dividing the 
graph nodes according to their eccentricity. Nodes of common eccentricity were 
positioned on their own circle. The arrangement and coloring of nodes in each 
circle were then refined to emphasize the symmetry in the graph. 

Second prize was awarded to Egon Pasztor (pasztor@merl.com), from MERL- 
A Mitsubishi Electric Research Laboratory, Cambridge, Massachusetts, U.S.A., 
and Doris Tsao (dtsao@fas.harvard.edu), from Harvard University, Cambridge, 
Massachusetts, U.S.A. They began with an analysis of the sums of successive 
powers of the graph’s adjacency matrix, which revealed eight exceptional pairs 
of nodes that were separated by a longer shortest path than all others. By manu- 
ally separating these nodes and gathering their neighbors, the graph was seen to 
be two sets of eight, trilevel binary trees. The leaves of these trees are connected 
by 64 edges, which represent an awkward permutation. Drawing the graph well 
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Fig. 7. First prize, Category C (original in color). 



therefore seemed to reduce to the problem of drawing the permutation in an 
orderly way. This was accomplished using a custom program that allows the 
user to interactively reorganize the crossing of a permutation. A more complete 
description of the permutation-crossing program and this unusual drawing is 
available on the WWW . 

2.4 Category D 

The only requirement for submissions in this category was that they be some 
form of artistic expression inspired by or related to graph drawing. 

The clear winner in this category was submitted by Roland Wiese (wiese@- 
informatik.uni-tuebingen.de) from the University of Tubingen, Germany. The 
drawing in Figure Elcontains four copies of K^q. The nodes are arranged so that 
they form a big blue circle in the center of the drawing. An orthogonal drawing 



Graph-Drawing Contest Report 431 




Fig. 8. Second prize, Category C (original in color). 



algorithm was used to compute the layout of the K^q graph. It was implemented 
in the GraVis system |^. 

Three other submissions were awarded joint second prize. The ASCII draw- 
ings in Figure E3 were created by Therese Biedl (therese@cs.mcgill.ca), who 
devised a family of ASCII-character patterns for lines of different slopes, using 
only symbols for which the symmetric symbol (with respect to the y-axis) also 
exists. 

The two other submissions to share second prize were both videos. Figure 1171 
shows 10 keyframes taken from an animation in which multiple drawings of Ala, 5 
blend one into the next. The set of drawings was generated automatically to bal- 
ance diversity and aesthetics Pj. This was joint work by: Therese Biedl, McGill 
University, Canada; Joe Marks, MERL-A Mitsubishi Electric Research Labo- 
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ratory, U.S.A. (marks@merl.com); Kathy Ryall, University of Virginia, U.S.A.; 
and Sue Whitesides, McGill University, Canada. 




Fig. 9. First prize. Category D (original in color). 



Figure ^3 shows a still from a video depicting a WWW site using an inter- 
active 3D graph-drawing system called IN3DNET. It was submitted by Marco 
Sbarrini (sbarrini@datamat.it) and Valerio Violi from the Third University of 
Rome, Italy. 



3 Observations and Conclusions 

The exceptionally high quality and originality of this year’s contest submissions 
demonstrates how far the field has progressed in the last few years. 
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Fig. 10. Joint second prize, Category D. 



Incremental layout still remains a challenge for the Graph Drawing commu- 
nity, and no doubt will continue to be included in future contests. However, the 
submissions for Category A showed that work on incremental layout and dia- 
gram animation is providing approaches that are at least partially successful at 
both preserving a user’s mental map and at manifesting graph-structure changes. 
Video submissions clearly had an advantage in demonstrating incremental-layout 
capabilities. The continued encouragement of video submissions should also help 
to stimulate applications of 3D layout. 

The entries for Categories B and C suggest worthwhile directions for future 
graph-drawing research (such as identifying and exploiting internal graph struc- 
ture, handling variable-sized nodes well, and supporting text annotation). As 
in past years, most of the winners combined automated and manual techniques 
to great effect. It is worth noting that the two winners in Category C both 
used graph-analysis techniques (i.e., measures of eccentricity, etc.) that are not 
computed by many commercial and academic graph-drawing systems. 

Category D, the “artistic” category, was received with much enthusiasm, and 
will be continued next year. 

The variety of approaches represented by the contest winners, and the enthu- 
siasm for the competition is encouraging, and we want to take this opportunity 
to thank all the entrants again. 

4 Acknowledgements 
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Fig. 11. Joint second prize, Category D. 




Fig. 12. Joint second prize, Category D (original in color). 



Robin Chen, Henry Kulzer, Brendan McKay, and Gordon Royle contributed 
graph data for the contest. Peter Eades, Michael Jiinger, Joe Marks, Petra 
Mutzel, and Stephen North served on the judging committee. Jan Kratochvil 
acted as arbiter and auditor. 
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Delaunay |5| is a visualization system to display and query object-oriented 
databases. It supports DOODLE fP, a visual and declarative meta-language, 
with which users can specify the display of quantitative information (such as 
bar charts and pie charts) and of qualitative information (such as graphs) . The 
visual vocabulary available to the user comprises only simple visual primitives 
such as box, circle, and line. It also supports constraints as a means of specifying 
distances between objects (length constraints) and occlusion between objects 
(overlap constraints). 

Work in graph drawing has traditionally focused on algorithmic approaches, 
which are designed to solve efficiently a limited class of graphs using a particular 
drawing. Less common approaches that allow the user to specify what the graph 
will look like, instead of how that drawing is going to be achieved are called 
declarative. Typically these approaches use constraints and are computationally 
inefficient 

We concentrate on the capability of DOODLE for specifying the drawing of 
graphs declaratively using the Delaunay system that we have been implementing. 
Besides addressing efficiency concerns, the solver should also deal with constraint 
cycles satisfactorily. For example, the SkyBlue 0 constraint solver actually fails 
to solve relatively simple constraints if they contain cycles, which are difficult to 
avoid in graph drawing problems. 

We have therefore designed and implemented our own constraint solver, based 
on the algorithm presented in 0. In 0, we showed that using DOODLE, we 
can specify a variety of drawings for a class of graphs that includes trees, series- 
parallel graphs, and acyclic digraphs. Furthermore, we have shown that such 
drawings can be achieved in optimal time. 

The role of the constraint solver is to calculate the absolute locations and 
dimensions of each visual object. The coordinates of these objects are represented 
by landmarks, the horizontal or vertical distance between any two landmarks 
being represented by a length constraint. The constraint solving strategy is as 

* Research supported in part by the National Science Foundation under CAREER 
Award IRI-9896052 and CISE Research Instrumentation Grant 9729878. 



S.H. Whitesides (Ed.): GD’98, LNCS 1547, pp. 436-|52] 1998. 
(c) Springer- Verlag Berlin Heidelberg 1998 



Implementation of an Efficient Constraint Solver 437 



follows. We build a constraint graph where each vertex represents a variable 
associated with a landmark, and each edge represents a dependency between 
two variables established by a constraint. The edges are directed for a max or 
min constraint, and are undirected otherwise for all other length constraints. 

First, we need the absolute horizontal and vertical coordinates of an origin 
landmark. From these coordinates, the solver attempts to calculate the positions 
of other landmarks that share a length constraint with the origin. In subsequent 
iterations, the values of the coordinates of the newly positioned landmarks are 
propagated, until all variables are instantiated. We have successfully solved con- 
straints where cycles are present and efficiency is guaranteed by avoiding dupli- 
cation of computations. Like the rest of the Delaunay prototype, the constraint 
solver is implemented using Java. 

We will be presenting details of the implementation, DOODLE programs 
that implement a variety of drawings (e.g., planar upward, containment, and 
H-V drawings of trees, Z\-drawings of series-parallel digraphs), and drawings as 
obtained by Delaunay. 
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This work studies the structure of origami bases via graph drawings of 
origami polyhedra. In particular, we propose a new class of polyhedra, called 
extreme-base polyhedra, that capture the essence of “extreme” origami bases. 
We develop a linear-time algorithm to find the “natural” straight-line planar 
drawing of these polyhedra. This algorithm demonstrates a recursive structure 
in the polyhedra that was not apparent before, and leads to interesting fractals. 

Our hope is to discover fundamental properties of paper folding through 
graph drawing. One important problem in origami mathematics is to find a use- 
ful characterization of when a crease pattern can be folded flat (using every 
crease). This problem is known to be NP-hard 0. However, this does not pre- 
clude the possibility of a simple characterization, only the tractability of such 
a characterization. For example, one conjecture that we have arrived at from 
this work is that every flat-foldable crease pattern has a Hamiltonian cycle. This 
property is insufficient, but a similar condition that is NP-hard to verify yet easy 
to understand may fill the gap in understanding flat origami. 

Origami bases are the starting point for designing origami models. An ex- 
treme origami base maps the boundary of a polygonal piece of paper (convex 
for our purposes) to a common plane. Two examples of extreme bases are given 
in Fig. n^c). The crease pattern of an extreme base is the union of the medial 
axis of the polygon and “perpendicular folds.” One interesting property of these 
extreme bases is that they can be folded while keeping the faces rigid. 



Fig. 1. (a-c) Folding of an extreme base for a triangle (top row) and quadrilateral 
(bottom row), (d) Polyhedra formed from partial foldings; the lower convex hull 
is drawn below. 

* Supported by NSERC. 
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In order to understand how the boundary of an extreme base becomes copla- 
nar, we add the lower convex hull to a partially folded extreme base, thus forming 
an extreme-base polyhedron; see Fig. ind). It can be shown that the topology of 
this polyhedron is independent of the stage of the partial folding. This work 
examines planar drawings of these polyhedra. We consider this infinite class of 
polyhedra to be part of a more general class of origami polyhedra, which come 
from partial foldings of other origami bases. 

A simple way to draw extreme-base polyhedra is to take the crease pattern 
and add curved edges representing the lower convex hull; see Fig. |2| (left). Basi- 
cally, there is a cycle of curved edges connecting the ends of the perpendicular 
folds emanating from a common vertex of the medial axis. 




Fig. 2. Left: Curved drawing of the polyhedron from Fig. 0 Middle and right: 
“Stretching” into a drawing with straight lines. 

One can imagine “stretching” the outside cycle to straighten out the curves, 
and recursing until we have a straight-line planar drawing; see Fig. El In the full 
version of this paper fP, we describe a linear-time algorithm for computing such 
a drawing directly from the topology of the medial axis, producing what we call 
the natural drawing; see Fig. El 




Fig. 3. The natural drawings of extreme-base polyhedra for hexagonal pieces 
of paper. 



One advantage of the natural drawing is that it displays a recursive structure 
in the polyhedra. By lazily generating an infinite tree (in principle representing 
the medial axis of a polygon with an infinite number of vertices) the algorithm 
produces interesting fractals; examine Fig. El for an idea. To help view these 
fractals, as well as complex finite drawings, we introduce a zoom feature that is 
similar in spirit to fisheye views El but retains the geometry in order to display 
the recursive structure. 
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Abstract 

Combinatorial graphs are increasingly used for information presentation. They 
provide high information density and intuitive display of multiple relationships, 
while offering low cost because they can be created algorithmically. Essential 
to algorithmic graph layout is a set of rules that encode layout objectives. How 
these rules are related to inferences drawn from the graph by human observers 
is a largely unexplored issue. Thus, success or failure by algorithmic standards 
is only uncertainly related to perceptual effectiveness of the resulting layout. 
Human experimentation is the only way to correct this deficiency. 

This poster describes empirical research conducted in 1994. Forty-six respon- 
dents, separated into naive and computer-aware groups, freely viewed a collection 
of graph layouts, providing semantic conclusions they reached on the basis of the 
layout, in the absence of any semantic attribution to nodes in themselves. We 
were interested in two questions. First, are semantic attributions consistent or 
random? If the former semantic objectives must be considered when creating 
layout rules or objective functions for automated graph layout. Second, if con- 
sistent semantic attributions exist, what are they? The remaining paragraphs of 
this abstract describe our results and conclusions. 

Most importantly, all our observers agreed strongly as to the semantic con- 
tent of specific graph layouts. There was no difference in interpretation between 
the group consisting of experienced programmers, and the group who had little 
exposure to computers. We were interested in possible differences because combi- 
natorial graphs are extremely common in computer science curriculum material, 
and it’s possible that a group of programmers might agree because they had 
been exposed to a common set of layout conventions. The agreement between 
our two groups demonstrates that semantic conventions extend widely. 

The most important semantic attribution is that observers view graph lay- 
outs hierarchically. Graphs are separated into interconnected subgraphs. Both 
the relative layout of the subgraphs and the layout within each subgraph are 
interpreted semantically. Thus, it is possible to encapsulate layout semantics in 
terms of relationships in graph layouts that have relatively few nodes. A few 
rules that we observed to be used by more than 90 

1. Nodes positioned symmetrically, whether in a circle, on a line or on a grid, 
are interpreted as having properties in common, or as being equal in status. 
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2. Nodes positioned centrally, and nodes positioned nearer the top of the 
layout are interpreted as having special properties, or as being higher in status. 

3. When nodes are arranged linearly, it is assumed that the sequence is sig- 
nificant, with the sequence going from left-to-right or from top-to-bottom. 

4. Drawn edges are used in determining interpretations of patterns. For ex- 
ample, the left figure is interpreted as a special node that is superior to two 
similar nodes, while the right figure is interpreted as three similar nodes. 



5. The absence of any of the above features is interpreted semantically as the 
absence of the interpreted quality. For example, the figure below shows three lay- 
outs of a K8 graph with one edge missing. The right layout maximizes symmetry 
and minimizes edge crossings (traditional aesthetic layout principles), however, 
it is difficult to discern immediately that an edge is missing. The middle layout 
breaks symmetry to allow quick identification of an anomalous state. The right 
layout breaks the symmetry of the edges in a manner that allows finding the 
missing edge easily, although more edge crossings are introduced. 






Another example shows how different layouts affect semantic interpretation of 
the same combinatorial graph. The left layout below is interpreted as a hierarchy 
of two elements controlling three others, whereas the right layout is interpreted 
as five equivalent elements. The absence and presence of different features creates 
different semantic meanings. 







We don’t claim that this list is comprehensive. The relationship of features 
like the ones we discovered to principles of perceptual organization, like those 
discovered by Gestalt psychologists, is clearly required to put our results onto 
a more solid quantitative footing. What is, however, abundantly clear from our 






Human Perception of Laid-Out Graphs 443 



experiments is that perceptually effective graph layout requires layout rules and 
objective functions that are far more complex than those in current use. The 
algorithmic consequences of the necessarily broadening of layout objectives re- 
mains to be explored. 

This project was the thesis research of the first author. The thesis, which 
provides a comprehensive description of the research, is available by e-mailing 
him at 

EADenglerOcgl . UWaterloo . ca. 
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The hugeness of the Web and its continuous growth have made navigation in 
the Internet extremely difficult. The new advanced features provided by HTML 
extensions and scripting languages allow a common browser to manage powerful 
hypermedial representation in each single page but leave unsolved some struc- 
tural problems of the Web. In fact, the process of finding information by surfing 
the Web is mainly hindered by the lack of a reasonable schema in the hyper- 
space; broken and redundant links make the problem even worse. This leads the 
user to become ’Tost in the hyperspace” (LH-Syndrome). 

Ptolomaeus is a system that helps the users to deal with the complexity of the 
Web, by generating and visualizing Web maps. It is written in Java and consists 
of a robot, which explores the Web with a user defined configuration, and of a 
graph drawing algorithm which draws the map and allows customizations. 

Ptolomaeus maps give a pictorial representation of the Web, using a graph 
in which each node is a Web page and each edge a link between two pages. We 
think that a clear representation of the structure of Web sites greatly enhances 
the exploring capabilities of a common browser, weakening the LH-Syndrome. 

Several other tools have been recently developed “around” the idea of Web 
maps. A comparison among the most popular of such tools can be found in the 
enclosed tables. We have considered features concerning the visualization, explo- 
ration, and browsing. The meaning of the fields is the following. Tool: commonly 
adopted name of the tool. Company: university, research institution, or private 
company that develops the tool. Layout: type of drawing convention adopted 
for the maps. Type of Graph: type of graphs that can be displayed. Several 
tools require the graph to be a tree and this is a severe limitation on the usability 
for understanding the portion of the Web that is under exploration. Visualiza- 
tion: specific visualization features. Browser: possible links with some browser. 
SW Requirements: minimal software suite required for using the tool. Robot: 
main features of the robot. Notes: other remarks. 

The approach that have been used in Ptolomaeus in order to find the best 
suitable layout algorithm follows two main ideas. A hyerarchical representation: 
users explore the Web from a starting page toward other related pages perceiving 
a hyerarchical structure. Graph structure of the Web: information is both in the 
pages and in the links between them. 



* Research supported in part by the ESPRIT LTR Project no. 20244 - ALCOM-IT 
and by the CNR Project “Geometria Computazionale Robusta con Applicazioni alia 
Grafica ed al CAD.” 
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According to these two principles we have chosen the drawing technique pro- 
posed by Sugiyama, Tagawa, Toda. In Ptolomaeus this technique is implemented 
with some customization. Particular emphasis has been given to the readability 
of the map, by using visualization features such as icons and colors to repre- 
sent different kinds of Web objects and by using clear labels written inside each 
node. Further, Ptolomaeus provides several layout facilities which allow to ev- 
idence relations between pages, to contract not interesting nodes, to zoom the 
map. 

Other facilities are present in Ptolomaeus concerning exploration and brows- 
ing. A user can setup Ptolomaeus to work with his/her preferred browser and 
can navigate in an unusual interesting way by alternating automatic exploration 
and direct browsing. A map can be seen as a complex bookmark which repre- 
sents the whole structure of a site instead of a single page. Web designer can 
exploit Ptolomaeus as a valuable help in the design and maintenance activities. 
In, fact Ptolomaeus evidences broken links, structural anomalies, and provides, 
in the ’’Applet version”, the users with an on line map usable in the exploration 
of the Web site. 

Further info at www.dia.uniroma3.it/~vernacot/ptolpage.htm. 



Tool 


Company 


Layout 


Type of Graph 


Visualization 


Browser 


astra 


mercury 


radial 


spanning tree 


animation, small labels 


any 


site server 


microsoft 


radial on a sphere 


spanning tree 


rotation, small labels 


explorer 


hy+ 
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1 Introduction 

Tom Sawyer Software produces commercial-quality layout and diagramming 
component technology for use by corporate enterprises, software providers, and 
educational institutions. Software application developers utilize Tom Sawyer 
Software’s technology to solve difficult modeling, complexity management, and 
diagram visualization problems. Our focus is the continued research into layout 
theory and how we can apply our technical expertise in solving our customers’ 
problems. To date we have two comprehensive product families, the Graph Lay- 
out Toolkit (GLT) and the Graph Editor Toolkit (GET). Designed as software 
components to be embedded within customer applications and equipped with 
a well-documented API, they are available for a wide range of programming 
environments. 

2 Products 

2.1 Graph Layout Toolkit 

The Graph Layout Toolkit j2| is a family of portable object positioning libraries 
designed for integration into Graphical User Interfaces (GUI). Gurrently avail- 
able in more than 50 configurations, it creates more understandable graph draw- 
ings by choosing geometric positions for nodes and a suitable routing for edges. 
The GLT can be used with almost any GUI toolkit, windowing system, compiler, 
operating system, or database through G, G-l— k, Java and ActiveX interfaces. 

2.2 Graph Editor Toolkit 

The Graph Editor Toolkit P provides a software framework that eases the ad- 
dition of advanced graph visualization technology to GUI products. The Graph 
Editor Toolkit is built upon and linked to two framework systems, the Graph 
Layout Toolkit and the Microsoft Foundation Glass (MFG) Library. It is cur- 
rently available on Microsoft Windows platforms as DLLs and ActiveX controls, 
and a port to Unix is underway. Furthermore, we are working on a 100% Pure 
Java implementation of the GET. 

* This work was supported in part by NIST, Advanced Technology Program grant 
number 70NANB5H1162. 
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3 Commercial Applications 

Graphs are commonly used in communicating relational information that arises 
in commercial applications ranging from database design to network topology 
diagrams. For example, many CASE tools use graphs to model the dependen- 
cies between modules in a large program. Further examples of diagrams which 
illustrate relational data are Data Flow Diagrams, PERT charts, various UML 
diagram types, and models abstracting complex physical networks such as large 
corporate communication networks. 

4 Software Architecture 

Tabled shows the 4-tier architecture of Tom Sawyer Software’s technology. The 
diagramming layer supports four primary layout styles with specific features 
available within each library. Other diagramming features include ports in hier- 
archical and orthogonal layout, incremental layout, and nesting functions. The 
abstraction layer provides folding, hiding, and collapse/expand functionality to 
simplify drawings. For applications requiring a GUI layer, the GET provides 



Foundation 


Functionality 


Presentation 

Abstraction 

Diagramming 

Model 


Zoom, scroll, menus, 
icons, inspectors 
Navigation, folding, 
hiding, nesting 
Sizes, routing, labels, 
layout, pasting 
Graph, nodes, edges 


user database 



Fig. 1. Technology overview. 




Fig. 2. Examples of presentation level 
functionality: property inspector and 
overview windows. 



components to create a presentation layer to access the underlying model, dia- 
gramming and abstraction layers (cf. Fig. EJ, as well as overview windows, prop- 
erty inspectors and high-level event handling. 
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Abstract. A wordgraph is a multidimensional outline capable of representing 
arbitrary finite directed graphs. For example the following pictorial directed graph: 




Can be represented as a wordgraph with n2 as the start node as follows: 
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The example contains 9 directed edges and 7 nodes. The start node is placed at 
the left hand edge of the document as would the root of an outline, all edges incident 
into to that node are placed above it, indented and represented uniquely with a less 
than symbol, Any edges incident out of a node are placed below and indented 
one level as in outlining. If a node has been previously presented then a caret is 
placed before the node to show that it is expanded elsewhere. In terms of a spanning 
tree these edges represented by their nodes would be cross edges. 

Outlining is increasingly adapted as a navigation, organizational and 
visualization tool for computer applications and is a very powerful metaphor for 
representing hierarchical or tree structures. Prior to Wordgraph, arbitrary complex 
systems could not be represented in an outline form. 

Implemented as a computer control or component, common techniques used 
with outline controls are employed; expansion and contraction with plus and minus 
objects, focusing on a particular node or edge and the use of icons to represent the 
entities represented by the nodes. Wordgraph provides great utility for applications 
where navigation, visualization and organization of complex systems is required. 
Changing the start node is accomplished by clicking on any node. A path from one 
node to any other node on the path can be displayed. Any directed graph can be 
represented as a wordgraph. The following is a contracted view representing the 
example from above: 
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j File Edit View Window 
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Wordgraph is patent protected technology which simplifies graph drawing by 
reducing the reliance on pictorial representations. 
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The use of multimedia tools in education has gained a lot of interest during 
the last decade (see, e.g., P). Free standing multimedia as well as tutorials dis- 
tributed via the Internet provide the potential for students to learn on their own, 
at their own pace, and in their own sequence, whereas textbooks or instructors 
usually impose a sequence how to learn the material. Besides general reasons 
for using multimedia tools in education the computer provides an excellent op- 
portunity to explain and visualize complex subjects with an abstract theoretical 
background. 

Currently the authors develop an Internet tutorial called VisA (Visualized 
Automata and Formal Languages) for the WWW where hypertext together with 
visualized and animated standard constructions (used in proofs) provide an easy 
and flexible access to automata theory and formal languages [ 2 |. The major part 
of the Internet tutorial is a software package, also called VisA, for displaying 
automata and standard constructions thereon in the WWW. VisA the software 
package is written in the object oriented programming language JAVA"'"'^ and 
can also be used as a stand-alone tool to teach basics of automata theory. Due to 
the wide range of the material of a complete undergraduate course in automata 
theory and formal languages, we have restricted the first implementation phase 
to finite automata only. 

Already existing stand-alone tools such as, e.g., AMoRE, FLAP, and 
GRAiL, only show and explain automata and their functioning, whereas VisA 
is also able to visualize, animate, and illustrate mathematical constructions in a 
step-hy-step manner. The idea to animate constructions (algorithmical behavior) 
is not new and already used in, e.g., the theory of data-structures and (compu- 
tational) geometry. However, to our knowledge it has not yet been used in the 
above mentioned context. In the so far implemented prototype the following 
visualizations as JAVA"”"^ applets are available (for the constructions see, e.g., 
tracing a deterministic or nondeterministic finite automaton on a given 
input string, (2) step-by-step conversion of a nondeterministic finite automaton 
into an equivalent deterministic one (powerset construction), (3) step-by-step 
conversion of a finite automaton into an equivalent regular expression, and (4) 
step-by-step conversion of a regular expression into an equivalent finite automa- 
ton. A screenshot showing one step in the animation of (2) is given in Figure Q 
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Fig. 1. One step in the animation of the powerset construction for state {goj 93 } 
applied to the finite automaton depicted on the left-hand side. The right-hand 
side shows the parts of the deterministic automaton constructed so far. The 
development is illustrated with the help of color illumination. 



Obviously, the visualization of constructions as shown in the above figure 
depends on good graph drawing algorithms. Since we are interested in step-by- 
step simulations of algorithms as well as interactive manipulations of the input 
objects (the automata) we need algorithms for incremental graph drawing. Since 
we have not found an appropriate incremental algorithm for our purpose, this 
might be a good starting point for further research. 

During autumn 1998 we expect to complete a prototype of the software pack- 
age and the Internet tutorial in order to use it in the undergraduate course 
“Einfiihrung in die Informatik III,” which will be held at the university of 
Tubingen in winter semester 1998/99. In parallel, empirical studies and statistics 
that the package achieves its goal in real classroom applications will be done. 
Under URL http : //www-f s . inf ormatik. uni-tuebingen. de/VisA/ further in- 
formation on Vis A will be available, soon. 
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An important and challenging task in cartography is the labeling of maps — 
attaching text to geographic features. Usually, the features to be labeled are 
regions, lines, and points on a map. One of the traditional formulations of the 
problem of labeling points is the point-feature-label placement problem, where 
we are given a set of points in the plane, and an axis-parallel rectangular label 
associated with each point, and the problem is to place each label with one corner 
at its associated point such that no two labels overlap. |K ImSiSIlKWa.hH iMShfll j 
IKba92j This problem is known to be NP-complete. There are approximation 
algorithms when the problem consists of scaling the labels. 

In this paper we propose an alternative where each label is a rectangle with 
fixed area, but varying height and width. The elastic labeling problem is to choose 
the height and width of each label, and the corner of the label to place at the 
associated point, so that no two labels overlap. 

The elastic labeling problem is useful when the goal of placing a label at a 
given point is to associate with the point some text consisting of more than one 
word. In this case we can write the specified text inside the label using one, two, 
or more rows, as long as the label is placed at the specified point. 



This is 




a Label 

41 





This is a Label 



Fig. 1. An Elastic Label. 



Our main result is a polynomial time algorithm for the special case of the elas- 
tic labeling problem when the given points lie on the boundary of our rectangular 
map. This “rectangle perimeter labeling problem” arises when the perimeter of 
the map is labeled with information about objects that lie beyond the boundary 
of the map, e.g. where the roads lead to, etc. This problem is likely to be relevant 
in GIS as maps are displayed dynamically on a computer screen using clipping, 
panning, and zooming. 

To solve the rectangle perimeter labeling problem, we first tackle two sub- 
problems where the points lie on only two sides of the rectangle, either two 
adjacent sides or two opposite sides. We call the case where the points lie on two 

* Research partially supported by NSERC 
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Fig. 2. A decomposition. 



adjacent sides of the rectangle the two-axis labeling problem, and the case where 
the points lie on two opposite sides of the rectangle the two-parallel lines labeling 
problem. Our polynomial time algorithm for the two-axis labeling problem was 
presented in l!lmbVhl . 

Here, we present a polynomial time algorithm for the two-parallel lines label- 
ing problem, and we show how to solve the rectangle perimeter labeling problem 
in polynomial time using algorithms for the two subproblems. 

For the two-parallel lines labeling problem we use a greedy approach, adding 
elastic rectangles from bottom to top, keeping the most recently added rectangles 
elastic, and fixing the earlier ones. To solve the rectangle perimeter labeling 
problem, we show that for any instance of the problem it suffices to consider 
a polynomial number of decompositions of the rectangle into regions in which 
labels from only two sides of the rectangle compete for space. (It is intuitively 
clear that near a corner of the map, for example, only the labels from two sides 
of the map are relevant.) Thus we can apply our algorithms for the special cases 
in these regions. The dashed lines in Figure 0 illustrate one such decomposition. 
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1 Introduction 

VGJ is an automated system capable of converting a textual description or a 
drawing of a graph into a well organized and readable layout of the graph. VGJ 
(visualizing graphs with Java), includes a graph editor and a set of algorithms 
that will automatically layout and draw a graph. The graph drawing package can 
be accessed through the web at: www.tufts.edu/ mccreary/graph_drawing.html. 

2 Graph Editing Capabilities 

The user interface of VGJ is very intuitive and contains both the typical features 
of tools that support drawing graphs as well as some additional features.. With 
a click of the mouse, a user can create nodes and edges, select single or multiple 
nodes and edges, and move selected objects. Initially the graph is directed and 
the edges are drawn with an arrow to indicate direction. The user can set the 
graph type to undirected by selecting an option in the properties menu. A dialog 
box allows the user to specify a node’s (i) shape (oval or rectangle), (ii) label, (iii) 
label position, (iv) exact node position and (v) node dimensions. The label can 
be placed in one of three ways: below the node, centered in the node or autosized. 
If the node is autosized, the label is placed inside the node and the node is sized 
to encompass the label. Associated with selected nodes are handles that allow 
then to be dragged, scaled proportionally or scaled in either dimension. Nodes 
can be combined and displayed as a single node through the ’’group control” 
menu. Groups can be created in stages and separated in reverse stages. 

A dialogue box with edge attributes is associated with each edge. The at- 
tributes include edge label, line style, bend points and data. Edge labels are 
drawn parallel to the edge so that edge identification is clear. Bends can be 
inserted by specifying the x- and y-coordinates of each of the bend points. 

Gorresponding to every visual graph is a textual representation in GML 
(Graph Modeling Language) [1]. The user can also convert the VGJ drawn graph 
to PostScript format. 

3 Layout Algorithms 

Gurrently VGJ offers three layout algorithms: Tree, to layout rooted trees; GGD, 
for directed graphs; and spring, for undirected graphs. There is also an algorithm 
to test a graph for biconnectivity or make it biconnected by adding edges. 
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Tree and Undirected Graph Layout: The tree algorithm implementation 

is that of Walker [4]. Trees are drawn so that nodes at same level lie on a 
straight line; parents are centered over their children; there is vertical symmetry; 
isomorphic subtrees are drawn identically. 

The undirected graph algorithm is that of Kamada and Kawai[2]. Their al- 
gorithm defines ‘energy’ between pairs of graph points and works to minimize 
the total energy of the graph. 



Directed Graph Layout: CGD, clan-based graph drawing, produces a layout 
for directed graphs [3]. The goals of the layout are to (1) follow the direction of 
the arcs so that ancestor nodes always lie above their descendants; (2) balance 
the nodes horizontally within each level; (3) have few edge crossings; (4) have few 
edge bends. The node layout is determined by the combination of (1) parsing of 
the graph into logically cohesive subgraphs and (2) defining layout attributes to 
apply to the resulting parse tree. The parse is based on a simple graph grammar, 
and the attributes that are now programmed into CGD produce a layout whose 
nodes are balanced both vertically and horizontally. Its parser is the first we 
know about that decomposes directed graphs into a tree of subgraphs (clans) . 

CGD defines an attribute grammar for the parse tree, and computes node lay- 
out through the attributes. Graph parsing in an improvement over the hierarchi- 
cal approach because it discovers clans, structures which have two-dimensional 
affinity rather than layers which have only one-dimensional similarity. The use of 
graph parsing distinguishes CGD from all other general directed graph drawing 
schemes. 

CGD’s drawings are unique in several ways: (1) The node layout is balanced 
both vertically and horizontally. (2) Nodes within a clan, a subgraph of nodes 
that have a common relationship with the rest of the nodes in the graph, are 
placed close to each other in the drawing. (3) Nodes are grouped according to 
a two-dimensional affinity rather than a single dimension such as level. (4) The 
layout can easily display nodes of varying sizes because space is reserved for each 
node in the bounding box attribute of the parse tree. 
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PLGOH±THMS FOR GRRPH DPRWZNG 



This poster presents AGD, a library of Algorithms for Graph Drawing. The library of- 
fers a broad range of existing algorithms for two-dimensional graph drawing and tools 
for implementing new algorithms. The algorithms include planar graph drawing meth- 
ods such as straight-line, polyline, orthogonal, visibility, and tree drawing methods. In 
order to make these algorithms useful for general graphs, we provide various planariza- 
tion methods ranging from heuristic to optimal algorithms. Data structures, like, e.g., 
PQ-trees, have been especially tailored for applications in graph drawing. Users can 
engineer their own hybrid methods by combining the provided tools like planarization, 
2-layer crossing minimization, and various shelling orders (see FigureQ]). 

Most graph drawing algorithms place a set of restrictions on the input graph like 
planarity or biconnectivity. We provide a mechanism for declaring such a precondition 
for a particular algorithm and checking it for potential input graphs. A drawing model 
can be characterized by a set of properties of the drawing. We call these properties 
the postcondition of the algorithm. There is support for maintaining and retrieving the 
postcondition of an algorithm. 

AGD is written in the programming language C++ and uses the LEDA platform 
for combinatorial and geometric computing. The design of the library is based on the 
object oriented features of C++. Graph drawing algorithms as well as combinatorial al- 
gorithms are modeled as classes. The implementations of exact optimization algorithms 
for NP-hard problems use the branch-and-cut system ABACUS. The algorithms are im- 
plemented independently of a visualization or graphics system by using a generic lay- 
out interface. Layout interfaces are currently available for the LEDA data type Graph- 
Win and for the graph editor Graphlet. The open design makes AGD very easy to use 
and to extend. AGD is publically available via http://www.mpi-sb.mpg.de/AGD/. 

* This project was partially supported by DFG-grants Ju204/7-3, Mul 129/3-1, Na 303/1-3, 
Forschungsschwerpunkt “Effiziente Algorithmen fur diskrete Probleme und ihre Anwendun- 
gen” 
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A paper on the design of AGD has appeared in: G.F. Italiano, S. Orlando (eds.), Proc. 
of the Workshop on Algorithm Engineering (WAE ’97), Venice, (http://www.dsi. 
unive . it/~wae97/proceedings/). 




Fig. 1 . Overview of the AGD library; grey labels indicate new or improved algorithms 
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1 Introduction 

Let U be a set of distinct points in some metric space. For each point a; G U, let 
Tx be the distance from x to its nearest neighbour, and let Sx be the open ball 
centered at x with radius equal to the distance from x to its nearest neighbour. 
We refer to these balls as the spheres of influence of the set V. The open sphere 
of influence graph on V is defined as the graph where {x, y) is an edge if and 
only if Sx and Sy intersect. 

The maximum size of sphere of influence graphs (SIGs) in Euclidean space 
has been studied. Most notably, Avis and Horton proved that the number of 
edges of a SIG in the Euclidean plane is linear with respect to the number 
of vertices P, and Guibas, Pach and Sharir extended the result to any fixed 
dimension |2| . However, little attention has been placed on SIGs in other metric 
spaces. In this abstract, we focus on the size of open SIGs in d-dimensional metric 
spaces induced by the oo-order Minkowski metric, where the spheres of influence 
are open hypercubes. We refer to these metric spaces as and present upper 
and lower bounds for the maximum number of edges in open L^-SIGs. 

2 An Upper Bound on the Number of Edges 

Lemma 1. Let S be a collection of open balls in such that no ball in S 
contains the center of any other ball in S . Then for all points p G p is 
contained in no more than 2‘^ balls of S. 

With the aid of LemmaH we would like to make our arguments by examining 
intersections which include the corners of the spheres of influence. However, since 
the spheres of influence are open and thus do not contain their corners, we instead 
choose to examine points which are inside the spheres and “close enough” to the 
corners. We refer to these points as e-corners and define them as follows. 

Let S be the spheres of influence of some point set, and let P be the set of all 
poly topes determined by pairwise intersections of balls in S. Let e be one-half 
the smallest width over all polytopes in P. For each corner c of a ball in S, we 
define an e-corner as c -I- evc^o where Vc^o is the unit vector in the direction 
from the corner to the center of the ball. 
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Lemma 2. Let X and Y be two intersecting spheres of influence in Then 
the number of e-corners of X plus the number of e-corners of Y contained in 
X C\Y is at least two. 

If we have n points, then there are 2‘^n e-corners. Each of these is involved in 
no more than 2^^ — 1 intersections, and by Lemma Q each intersection contains 
at least two e-corners. This leads us to Theorem [0 

Theorem 1. No open sphere of influence graph ofn vertices in has — 

edges or more. 

3 A Lower Bound on the Maximum Number of Edges 

We construct a lattice in space such that each vertex has — 6d — 8) /9 

sphere of influence neighbours. 

Let Uj be the unit vector parallel to the dimensional axis. We generate a 
lattice with the basis B = {hi ■. 1 < i < d] such that 

hi — Ui fl- Si ^ ^ Uj Si ^ ^ Uj 
j<i j>i 

where each Si is a small positive real number, such that 0 < < 1/4, and 

0 < for all 2 < j < d. Let this d-dimensional lattice be known as T^. 

Theorem 2. In the infinite lattice Td, each vertex has (2^^+^ — 6d — 8)/9 sphere 
of influence neighbours. 

Thus for finite SIGs of n vertices which are subsets of Td, we achieve an 
asymptotic bound of ((2^^^+^ — 3d — 4)/9)n edges. 
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Introduction 

Triangulation of a set of points is a fundamental structure in computational ge- 
ometry. According to the authors’ best knowledge, there is not much research 
done on maximum weight triangulation, MaxWT. From the theoretical view- 
point, MaxWT and its counterpart, the minimum weight triangulation, attract 
equally interest. The graph drawings as MaxWT are investigated. 

• The weight of a triangulation T{P) is given by 

uj{T{P)) = uj{prpj), 

pTpJ<^T{P) 

where uj{pipj) is the Euclidean length of line segment PiPj- 

• A maximum weight triangulation of P (MaxWT (P)) is defined as for all 
possible T{P), uj{MaxWT{P)) = max{uj{T{P))}. 

Main Results 

(A) O(n^) time algorithm for MaxWT of an inscribed n-gon. 

(B) 0(n) time algorithm for MaxWT of a regular n-gon. 

(C) O(n^) time 0.5-approximation algorithm for MaxWT of a general convex 
n-gon. 

(D) Linear-time algorithm for maximum drawing of Caterpillar graphs. 

(E) Forbidden (non-maximum weight drawable) graphs on any convex point 
set. 

(A) Inscribed Polygon Case 

FACT 1: If P is a convex polygon, then each interior angle of any fly triangle 
of the MaxWT (P) must be no less than 

FACT 2: If P is a convex polygon, then no interior angle of any fly triangle 
of MaxWT (P) is larger than 

* This work is supported by NSERC grant OPG0041629 and RGC grant HKU 
541/96E. 
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FACT 3: If P is an inscribed polygon. Then MaxWT{P) cannot contain 
any fly triangle. That is, the internal edges of MaxWT{P) form a tree. 

Recurrence Formula 

• Wij The weight of convex subpolygon of the inscribed polygon P with 

vertics + 1, j) for t, je[0, n]. 

/ 0 if J — (^ “t” ^^modulo.n] 

^ max{Wij-i,Wi+ij} + ui(ij) otherwise 

• O(n^) time algorithm. 

(B) Regular Polygon Case 

• Any inner-spanning tree of a regular n-gon P is maximum and it together 
with the boundary edges of P form a MaxWT{P). 

• An inner-spanning tree of a regular n-gon P can be found in linear time. 

(C) A 0.5- Approximation Algorithm for MaxWT(P) 

• Let Aabc be a fly triangle of MaxWT{P). The removal of a fly triangle 
Aabc will divide P into three components, each associates with an edge of Aabc. 
Aabc is called an ear-fly triangle if at most one of its three components contains 
other fly triangles. 

• An (n^) time approximate algorithm which guarantees — 5- 

(D) Maximum Weight Drawing of Caterpillar Graph 

• eaterpillar is a tree such that all internal nodes connect to at most 2 non-leaf 
nodes. 

• 0(n) time algorithm. 

(E) Forbidden Graphs for Maximum Weight Drawing 

• A graph G is outerplanar if it has a planar embedding such that all its 
nodes lie on a single face; an outerplanar graph is maximal if no edge can be 
added to the planar embedding without crossing. 

• If G{V,E) is a maximal outerplanar graph containing a simple cycle G 
with four nonconsecutive nodes which form two triangles sharing a common 
edge, then G cannot have a maximum weight drawing. 

Concluding Remarks 

• Does MaxWT(P) can be found in o(n^) time? 

• Does every maxmal planar graph admit a maximum weight drawing? 
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There are two kinds of approaches for orthogonal graph drawing: one supports 
planar and almost planar graphs (Giotto 0, Kandinsky) and is based on a min- 
cost flow algorithm. The second does not take planarity into account. It is there- 
fore conceptually much simpler and runs in linear-time. Representatives of such 
concepts are the papers In a variant of this last paper, Biedl/Kaufmann 

^ achieve the theoretically best area bound. The idea is the following: 

The graph is drawn such that each edge bends exactly once. Outgoing edges 
leave the vertex from the right or left side, incoming edges arrive at the top or 
bottom side. Edges are directed in a quasi- s-t-ordering such that each vertex gets 
at least one incoming and one outgoing edge. Then we assign disjoint rows to the 
vertices such that the number of outgoing edges leaving the vertex v to different 
directions is nearly balanced. The same is done for the column assignment where 
the numbers of incoming edges from different sides has to be balanced. Applying 
this simple scheme yields an area bound of (m-|-n)/2 x (m-|-n)/2 for the graph. 

Making graph algorithms aware of constraints is a very important task. We do 
it here for the Biedl/Kaufmann scheme. 

First we consider the behaviour of the algorithm in respect to the aspect ratio 
of the drawn nodes. With the original algorithm the aspect ratio of v can get as 
bad as 2 : deg{v) because the number of in-and outgoing edges for each vertex 
is not balanced. In the orginal paper the authors already gave an idea how 
to achieve an aspect ratio of 1:2, based on Eulerian paths. Unfortunately, the 
nice balancing of the edges is given up, such that the area bound is doubled. We 
improve this scheme so that it works very satisfying in practice. 

On the theoretical side, we give improvements for some special cases, namely 
for graphs with degree at most 3 and for planar graphs. We conjecture that an 
improvement to (2/3 • m -I- n/2) x (2/3 • m -I- n/2) of the area is possible using 
the Eulerian path technique in an appropriate way. 

In practice a flexible approach based on edge flipping heuristics provides even 
better aspect ratios and keeps the size of drawing area close to the good bounds 
of the unconstraint algorithm. 

Next, we look at edge constraints, where the edges have to be directed in a 
certain direction. There is a straightforward approach already published in 
Here the row and column assignment in the basic scheme is done such that the 
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Fig. 1. Kio drawn without and with aspect ratio constraints. For the drawing 
on the left an edge flipping heuristic was applied. 

constraints for the edge directions are fulfilled. But again we lose the balancing of 
the edges so that the area bound is even quadrupled. We propose an alternative 
approach that works especially good for one-dimensional constraints. 





Fig. 2. Two downward drawings of the same graph. On the left drawn with a 
simple algorithm and on the right with our linear-time constraint algorithm. 
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1 Introduction 

Given a simple polygon P, a covering triangulation is another triangulation 
over the vertices of P and some inner Steiner points (see Fig 1 for a covering 
triangulation generated by our heuristic). In other words, when computing a 
covering triangulation one is only allowed to add Steiner points in the interior 
of P. This problem is originally from mesh smoothing: one is not happy with 
the mesh over a specific region (say P) and would like to re-triangulate that 
region. Certainly, adding Steiner points on the boundary of P would destroy the 
neighboring part of P and would result in further changes of the mesh. 




Fig. 1. A covering triangulation built by our heuristic algorithm, Steiner points 
are round. 

When computing a covering triangulation of P, we usually want to make sure 
that the triangles have certain good property (and in fact this results in aesthet- 
ically beautiful covering triangulation). Among these properties, minmax angle, 
maxmin angle have received much attention. Mitchell obtained good approxima- 
tion algorithms for optimizing both of these qualities IMTrnirWTFFTI . However, 
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for another important covering triangulation, namingly the one maximizing the 
minimum height, very little is known. 

In this report we briefly mention some facts we already know about the 
maxmin height covering triangulation. Then we introduce a heuristic for gener- 
ating a covering triangulation which seems to have good empirical results. 



2 Results 

First of all, we briefly introduce something we know about maxmin height cov- 
ering triangulation. Because of the space constraint, we do not illustrate any of 
the proofs. 

Theorem 1. Given a simple polygon, the smaller of the minimum vertex/edge 
distance and the minimum edge length is the upper hound of the optimal height 
in the maxmin height covering triangulation. 



Theorem 2. Given a simple polygon, the minimum length stable diagonal is the 
upper bound of the optimal height in the maxmin height covering triangulation. 

Here the vertex/edge distance between an edge e and a vertex u of P is the 
vertical distance between v and vi where vvi is vertical to e and vi lies in the 
interior of e; moreover, vvi does not intersect P. A stable diagonal of P is the 
diagonal of P which appears in any triangulation of P. Clearly a convex polygon 
has no stable diagonal and a non-convex 4-gon has exactly one stable diagonal. 




Fig. 2. An illustration of the heuristic algorithm. 



Unfortunately the above facts do not immediately lead to any good approx- 
imate (or exact) solution. Nevertheless, we present a very simple heuristic to 
compute and draw a high quality covering triangulation. The idea is to try to 
cut an ear off at each stage. We note that when cutting an ear off (i.e., by adding 
a diagonal uv) this diagonal might lead to very small height in the ear. If this 
happens, then we will add an interior Steiner point s to increase the local triangle 
heights and to decompose the polygon into two pieces (to the left of the wedge 
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usw and to the right of the wedge vsw in Fig 2) which allow us to recursively 
apply this heuristic. (There are several such cases and here we just illustrate one 
of them, because of the space constraint.) However, we cannot prove formally 
that this heuristic lead to a good approximation solution as after each step the 
upper bound we illustrate in Theorem 1 and 2 changes. We run our heuristic 
over many manually input polygons, the empirical results seem to be promising 
(see Fig 1 for an example). Clearly this heuristic runs in 0{n^) time theoretically 
if we employ a linear time algorithm to slice off an ear lEEinni. 

Closing Remarks. We note that if the Steiner points are allowed to be on 
the boundary of P, then constant factor approximate solution exists EHMI- 
We raise several problems to conclude this report. (1) Can we design a good 
approximate algorithm for this problem? (2) What can we say theoretically 
about our simple heuristic? 
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